<86>Feb 9 05:31:31 userdel[3653992]: delete user 'rooter' <86>Feb 9 05:31:31 userdel[3653992]: removed group 'rooter' owned by 'rooter' <86>Feb 9 05:31:31 userdel[3653992]: removed shadow group 'rooter' owned by 'rooter' <86>Feb 9 05:31:31 groupadd[3653999]: group added to /etc/group: name=rooter, GID=1211 <86>Feb 9 05:31:31 groupadd[3653999]: group added to /etc/gshadow: name=rooter <86>Feb 9 05:31:31 groupadd[3653999]: new group: name=rooter, GID=1211 <86>Feb 9 05:31:31 useradd[3654005]: new user: name=rooter, UID=1211, GID=1211, home=/root, shell=/bin/bash, from=none <86>Feb 9 05:31:31 userdel[3654015]: delete user 'builder' <86>Feb 9 05:31:31 userdel[3654015]: removed group 'builder' owned by 'builder' <86>Feb 9 05:31:31 userdel[3654015]: removed shadow group 'builder' owned by 'builder' <86>Feb 9 05:31:31 groupadd[3654022]: group added to /etc/group: name=builder, GID=1212 <86>Feb 9 05:31:31 groupadd[3654022]: group added to /etc/gshadow: name=builder <86>Feb 9 05:31:31 groupadd[3654022]: new group: name=builder, GID=1212 <86>Feb 9 05:31:31 useradd[3654028]: new user: name=builder, UID=1212, GID=1212, home=/usr/src, shell=/bin/bash, from=none /usr/src/in/srpm/rccl-2.18.6-alt0.1.src.rpm: bad symbols in the license tag: // <13>Feb 9 05:31:37 rpmi: libidn2-2.3.7-alt1 sisyphus+339505.100.1.2 1706718968 installed <13>Feb 9 05:31:37 rpmi: libnettle8-3.10.1-alt1 sisyphus+372008.100.1.1 1738078259 installed <13>Feb 9 05:31:37 rpmi: libp11-kit-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Feb 9 05:31:37 rpmi: libtasn1-4.19.0-alt3 sisyphus+327816.100.1.1 1692802615 installed <13>Feb 9 05:31:37 rpmi: libhogweed6-3.10.1-alt1 sisyphus+372008.100.1.1 1738078259 installed <13>Feb 9 05:31:37 rpmi: libgnutls30-3.8.8-alt2 sisyphus+364832.100.1.1 1734007749 installed <13>Feb 9 05:31:37 rpmi: libngtcp2.16-1.10.0-alt1 sisyphus+366376.200.1.1 1735020753 installed <13>Feb 9 05:31:37 rpmi: libngtcp2_crypto_gnutls8-1.10.0-alt1 sisyphus+366376.200.1.1 1735020753 installed <13>Feb 9 05:31:37 rpmi: cmake-modules-3.31.5-alt1 sisyphus+371742.100.1.1 1737807519 installed <13>Feb 9 05:31:37 rpmi: libuv-1.49.2-alt1 sisyphus+369779.100.1.1 1737060735 installed <13>Feb 9 05:31:37 rpmi: librhash-1.3.5-alt3 sisyphus+286141.40.2.1 1632982456 installed <13>Feb 9 05:31:37 rpmi: libjsoncpp24-1.9.4-alt2 sisyphus+346331.200.2.1 1716448551 installed <13>Feb 9 05:31:37 rpmi: libexpat-2.6.4-alt1 sisyphus+365521.100.1.1 1734700243 installed <13>Feb 9 05:31:37 rpmi: publicsuffix-list-dafsa-20250131-alt1 sisyphus+373297.100.1.1 1738767834 installed <13>Feb 9 05:31:37 rpmi: libpsl-0.21.5-alt1 sisyphus+338474.100.1.1 1705684769 installed <13>Feb 9 05:31:37 rpmi: libnghttp3.9-1.7.0-alt1 sisyphus+366376.100.1.1 1735020696 installed <13>Feb 9 05:31:37 rpmi: libnghttp2-1.64.0-alt1 sisyphus+363795.200.2.1 1733118555 installed <13>Feb 9 05:31:37 rpmi: openldap-common-2.6.9-alt2 sisyphus+367501.300.4.1 1735841751 installed <13>Feb 9 05:31:37 rpmi: libntlm-1.5-alt1 sisyphus+278100.3300.1.1 1626058899 installed <13>Feb 9 05:31:37 rpmi: libidn-1.37-alt2 sisyphus+300849.100.1.1 1653769687 installed <13>Feb 9 05:31:37 rpmi: libverto-0.3.2-alt1_1 sisyphus+321176.2200.10.2 1684803947 installed <13>Feb 9 05:31:37 rpmi: liblmdb-0.9.33-alt1 sisyphus+360625.100.1.1 1729819640 installed <13>Feb 9 05:31:37 rpmi: libkeyutils-1.6.3-alt1 sisyphus+346336.200.2.2 1716472658 installed <13>Feb 9 05:31:37 rpmi: libcom_err-1.47.1.0.10.ad56-alt2 sisyphus+363497.200.3.1 1732729908 installed <13>Feb 9 05:31:37 rpmi: libbrotlicommon-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Feb 9 05:31:37 rpmi: libbrotlidec-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Feb 9 05:31:37 rpmi: openssl-config-3.2.0-alt1 sisyphus+366659.140.4.1 1736956949 installed <13>Feb 9 05:31:37 rpmi: rpm-macros-cmake-3.29.1-alt1 sisyphus+344518.300.3.1 1712379787 installed <13>Feb 9 05:31:37 rpmi: rpm-macros-alternatives-0.5.3-alt1 sisyphus+371878.100.1.1 1737988822 installed <13>Feb 9 05:31:37 rpmi: alternatives-0.5.3-alt1 sisyphus+371878.100.1.1 1737988822 installed <13>Feb 9 05:31:37 rpmi: ca-certificates-2024.12.10-alt1 sisyphus+364633.200.3.1 1733918603 installed <13>Feb 9 05:31:37 rpmi: ca-trust-0.2.0-alt1 sisyphus+344843.100.1.1 1712743326 installed <13>Feb 9 05:31:37 rpmi: p11-kit-trust-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Feb 9 05:31:37 rpmi: libcrypto3-3.3.2-alt1 sisyphus+366659.40.4.1 1736956900 installed <13>Feb 9 05:31:37 rpmi: libssl3-3.3.2-alt1 sisyphus+366659.40.4.1 1736956900 installed <86>Feb 9 05:31:37 groupadd[3655777]: group added to /etc/group: name=_keytab, GID=999 <86>Feb 9 05:31:37 groupadd[3655777]: group added to /etc/gshadow: name=_keytab <86>Feb 9 05:31:37 groupadd[3655777]: new group: name=_keytab, GID=999 <13>Feb 9 05:31:37 rpmi: libkrb5-1.21.3-alt2 sisyphus+351857.100.1.1 1719735141 installed <13>Feb 9 05:31:37 rpmi: libgsasl18-2.2.1-alt2 sisyphus+359713.200.2.1 1728905430 installed <86>Feb 9 05:31:37 groupadd[3655784]: group added to /etc/group: name=sasl, GID=998 <86>Feb 9 05:31:37 groupadd[3655784]: group added to /etc/gshadow: name=sasl <86>Feb 9 05:31:37 groupadd[3655784]: new group: name=sasl, GID=998 <13>Feb 9 05:31:37 rpmi: libsasl2-3-2.1.28-alt2.1 sisyphus+367419.100.1.1 1735482560 installed <13>Feb 9 05:31:37 rpmi: libldap2-2.6.9-alt2 sisyphus+367501.300.4.1 1735841751 installed <13>Feb 9 05:31:37 rpmi: libarchive13-3.7.5-alt2 sisyphus+358189.100.1.1 1727162763 installed <13>Feb 9 05:31:37 rpmi: libssh2-1.11.0-alt2 sisyphus+339356.100.1.1 1706593137 installed <13>Feb 9 05:31:37 rpmi: libcurl-8.12.0-alt1 sisyphus+373228.100.1.1 1738746008 installed <13>Feb 9 05:31:38 rpmi: cmake-3.31.5-alt1 sisyphus+371742.100.1.1 1737807519 installed <13>Feb 9 05:31:59 rpmi: llvm-common-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:31:59 rpmi: llvm-rocm-filesystem-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:31:59 rpmi: libnuma-2.0.19-alt1 sisyphus+363830.100.1.1 1733131852 installed <13>Feb 9 05:31:59 rpmi: rocm-device-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:31:59 rpmi: llvm18.1-filesystem-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:00 rpmi: clang18.1-support-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:00 rpmi: llvm18.1-polly-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:00 rpmi: gcc-c++-common-1.4.28-alt1 sisyphus+348678.100.1.1 1716396142 installed <13>Feb 9 05:32:00 rpmi: libstdc++14-devel-14.2.1-alt1 sisyphus+360995.100.1.1 1730131018 installed <13>Feb 9 05:32:00 rpmi: librocm-smi1-6.1.2-alt0.3 sisyphus+362389.100.1.1 1731447319 installed <13>Feb 9 05:32:00 rpmi: libpciaccess-1:0.18.1-alt1 sisyphus+343583.300.1.1 1711440789 installed <13>Feb 9 05:32:01 rpmi: libdrm-1:2.4.124-alt1 sisyphus+364215.100.1.1 1733469813 installed <13>Feb 9 05:32:01 rpmi: libhsakmt1-6.1.2-alt0.1 sisyphus+352247.600.5.1 1720254766 installed <13>Feb 9 05:32:01 rpmi: libhsa-runtime1-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Feb 9 05:32:01 rpmi: libpci-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Feb 9 05:32:01 rpmi: pciids-20250131-alt1 sisyphus+372278.100.1.1 1738351325 installed <13>Feb 9 05:32:01 rpmi: pciutils-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Feb 9 05:32:01 rpmi: libmpdec3-2.5.1-alt3 sisyphus+314490.500.5.1 1675432004 installed <13>Feb 9 05:32:01 rpmi: libgdbm-1.8.3-alt10 sisyphus+346222.200.3.2 1716468404 installed <13>Feb 9 05:32:01 rpmi: libb2-0.98.1-alt1_1 sisyphus+291614.100.1.1 1638962877 installed <13>Feb 9 05:32:01 rpmi: python3-3.12.8-alt1 sisyphus+364336.100.1.1 1733526854 installed <13>Feb 9 05:32:02 rpmi: python3-base-3.12.8-alt1 sisyphus+364336.100.1.1 1733526854 installed <13>Feb 9 05:32:02 rpmi: clang-rocm-libs-support-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:06 rpmi: clang-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:06 rpmi: rocminfo-6.1.2-alt0.1 sisyphus+352247.1700.9.1 1720269882 installed <13>Feb 9 05:32:06 rpmi: libedit3-3.1.20230828-alt1 sisyphus+330914.200.3.1 1696922743 installed <13>Feb 9 05:32:06 rpmi: llvm18.1-gold-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:08 rpmi: llvm18.1-libs-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:09 rpmi: libclang-cpp18-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:09 rpmi: clang18.1-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:09 rpmi: clang-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:32:12 rpmi: clang-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:14 rpmi: llvm18.1-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:14 rpmi: llvm-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:32:33 rpmi: llvm-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:33 rpmi: libclang18-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:34 rpmi: clang18.1-devel-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:34 rpmi: clang-devel-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:32:35 rpmi: clang18.1-tools-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:35 rpmi: clang-tools-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:32:45 rpmi: clang-rocm-tools-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:45 rpmi: lld18.1-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:32:45 rpmi: lld-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:32:46 rpmi: lld-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:49 rpmi: libamd_comgr2-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:50 rpmi: llvm-rocm-gold-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:51 rpmi: llvm-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:52 rpmi: hip-runtime-amd-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Feb 9 05:32:52 rpmi: hipcc-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:32:55 rpmi: mlir18.1-tools-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:33:22 rpmi: llvm18.1-devel-18.1.8-alt0.4 sisyphus+364551.100.1.1 1733763186 installed <13>Feb 9 05:33:22 rpmi: llvm-devel-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Feb 9 05:33:41 rpmi: llvm-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:33:41 rpmi: hip-devel-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Feb 9 05:33:41 rpmi: rocm-comgr-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:33:56 rpmi: clang-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Feb 9 05:33:57 rpmi: hipify-clang-6.1.2-alt0.1 sisyphus+352428.200.1.1 1720459887 installed <13>Feb 9 05:33:57 rpmi: hsa-rocr-devel-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Feb 9 05:33:57 rpmi: librocm-smi-devel-6.1.2-alt0.3 sisyphus+362389.100.1.1 1731447319 installed <13>Feb 9 05:33:57 rpmi: libstdc++-devel-14-alt1 sisyphus+360995.300.1.1 1730139222 installed <13>Feb 9 05:33:57 rpmi: rocm-cmake-6.1.2-alt0.1 sisyphus+352247.100.1.1 1720180839 installed Building target platforms: x86_64 Building for target x86_64 Wrote: /usr/src/in/nosrpm/rccl-2.18.6-alt0.1.nosrc.rpm (w1.gzdio) Installing rccl-2.18.6-alt0.1.src.rpm Building target platforms: x86_64 Building for target x86_64 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.5387 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf rccl-2.18.6 + echo 'Source #0 (rccl-2.18.6.tar):' Source #0 (rccl-2.18.6.tar): + /bin/tar -xf /usr/src/RPM/SOURCES/rccl-2.18.6.tar + cd rccl-2.18.6 + /bin/chmod -c -Rf u+rwX,go-w . + subst 's,cat ${ROCM_PATH}/.info/version,echo 6.1.2,' CMakeLists.txt + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.5387 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + export ALTWRAP_LLVM_VERSION=rocm + ALTWRAP_LLVM_VERSION=rocm + mkdir -p x86_64-alt-linux + cmake -DCMAKE_SKIP_INSTALL_RPATH:BOOL=yes '-DCMAKE_C_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_CXX_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_Fortran_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' -DCMAKE_INSTALL_PREFIX=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_DESTINATION=lib64 -DLIB_SUFFIX=64 -S . -B x86_64-alt-linux -Wno-dev -DROCM_PATH=/usr -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DCMAKE_INSTALL_LIBDIR=lib64 -DENABLE_MSCCL_KERNEL=ON -- The CXX compiler identification is Clang 17.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/clang++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- Checking for ROCm support for GPU targets: -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Compiling for gfx803;gfx900:xnack-;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack-;gfx90a:xnack+;gfx940;gfx941;gfx942;gfx1030;gfx1100;gfx1101;gfx1102 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- ROCM_PATH found: /usr -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc -- hipcc version: 6.1.40093 -- ROCm version: 6.1.2 ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:87 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:88 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipEventDisableSystemFence -- Looking for hipEventDisableSystemFence - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:99 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:73 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:87 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:88 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:99 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:73 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.h -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp -- HIP_UNCACHED_MEMORY enabled -- RCCL LL128 protocol enabled -- Building shared RCCL library -- rocm-cmake: Set license file to /usr/src/RPM/BUILD/rccl-2.18.6/LICENSE.txt. -- Configuring done (23.6s) -- Generating done (0.1s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_C_COMPILER CMAKE_C_FLAGS CMAKE_Fortran_FLAGS LIB_DESTINATION LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux + cmake --build x86_64-alt-linux --verbose --parallel 8 Change Dir: '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j8 gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -S/usr/src/RPM/BUILD/rccl-2.18.6 -B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux --check-build-system CMakeFiles/Makefile.cmake 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux//CMakeFiles/progress.marks gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/Makefile2 all /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /usr/src/RPM/BUILD/rccl-2.18.6/cmake/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Built target git_version_check gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/collectives/all_reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/all_gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/collectives/all_to_allv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_allv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/collectives/all_to_all.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_all.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/broadcast.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/broadcast.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/channel.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/channel.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/transport/shm.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/shm.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/bootstrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/bootstrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/broadcast.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/broadcast.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/alltoall_pivot.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/alltoall_pivot.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/all_gather.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_gather.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/common_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/onerank_reduce.cu -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/onerank_reduce.cu -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/all_reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/common.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/msccl_kernel_impl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/msccl_kernel_impl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/primitives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/primitives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/op128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/op128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_simple.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_simple.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/reduce_scatter.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_scatter.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/sendrecv.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/sendrecv.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/msccl.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/msccl.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce_scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce_scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/collectives/sendrecv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/sendrecv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/debug.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/debug.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/graph/connect.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/connect.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rings.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rings.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/paths.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/paths.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/rome_models.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/enqueue.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/enqueue.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rome_models.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/search.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/search.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/trees.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/trees.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/tuning.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/tuning.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/xml.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/xml.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/group.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/group.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/BfdBacktrace.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/BfdBacktrace.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/align.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/align.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/alloc.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/alloc.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/archinfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/archinfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/argcheck.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/argcheck.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/bootstrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/bootstrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/channel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/channel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/coll_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/coll_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/checks.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/checks.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/collectives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/collectives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/comm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/comm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/cpuset.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/cpuset.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/core.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/core.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/debug.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/debug.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/enqueue.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/enqueue.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/devcomm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/devcomm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/git_version.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/git_version.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/gdrwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/gdrwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/group.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/group.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvsymbols.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvsymbols.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/graph.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/graph.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/info.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/info.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/ibvcore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvcore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_lifecycle.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_lifecycle.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ipcsocket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ipcsocket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/msccl/msccl_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_parser.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_parser.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_scheduler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_scheduler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_setup.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_setup.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_status.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_status.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/nccl_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nccl_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_event.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_event.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvmlwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvmlwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCuda.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCudaRt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtOpenCL.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtSync.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtPayload.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtx3.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx_stub.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx_stub.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/p2p.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/p2p.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/profiler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/profiler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_bfloat16.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_bfloat16.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_vars.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_vars.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/param.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/param.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/proxy.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/proxy.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/rocmwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocmwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/rocm_smi_wrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocm_smi_wrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/shm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/shm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/signals.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/signals.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/timer.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/timer.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/socket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/socket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/strongstream.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/strongstream.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/transport.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/transport.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/trees.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/trees.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/archinfo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/archinfo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/include/utils.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/utils.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/argcheck.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/argcheck.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvsymbols.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvsymbols.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ipcsocket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ipcsocket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/init.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/init.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_lifecycle.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_setup.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_setup.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_parser.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_parser.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/nvmlwrap_stub.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/nvmlwrap_stub.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/npkit.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/npkit.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_status.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_status.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/param.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/param.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/profiler.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/profiler.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocm_smi_wrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocm_smi_wrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/shmutils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/shmutils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/signals.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/signals.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocmwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocmwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/utils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/utils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/strongstream.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/strongstream.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/proxy.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/proxy.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport/coll_net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/coll_net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/nvls.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/nvls.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 41%] Hipifying src/transport/p2p.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/p2p.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_ib.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_ib.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ 3 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ 3 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx1101. 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx1100. 5 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx942. 5 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 3 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 3 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ 4 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ 3 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.ccc:c104l:R12e:s uwarning: lunused variable 'y' [-Wunused-variable]t _t ncclTopoDev T104o | R a nikn(ts txr=u0c,t ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] y229= | 0s;t a t| i ^c float nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.ccc:l111T:o7p:o Xwarning: Gunused variable 'localRanks' [-Wunused-variable]M ISpeed( c111o | n s ti ncth alro*c aglcRna)n k{s =| ^~~~~~~~~~~~~~~~~c omm-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc>:t230o:p21o:- >warning: nunused function 'getIndexes' [-Wunused-function]o des[GPU ]230. | csotuantti;c n| c ^~~~~~~~~~c lResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ 20 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ 20 warnings generated when compiling for gfx1100. 20 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx803. 20 warnings generated when compiling for gfx940. 20 warnings generated when compiling for gfx908. 20 warnings generated when compiling for gfx90a. 20 warnings generated when compiling for gfx906. 20 warnings generated when compiling for gfx1030. 20 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx942. 20 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ 28 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc 773 | : 1238 :i15n:t warning: nunused variable 'ringRemap' [-Wunused-variable]C hannels = 0; 1238| | ^~~~~~~~~ st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cca:t783i:c12 :c hwarning: aunused variable 'y' [-Wunused-variable]r ri n783g | R e mianpt[ 6x4=]0;, y| = ^~~~~~~~~0 ; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc| : ^1242 :7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx941. 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx942. 28 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx900. 10 warnings generated when compiling for gfx803. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx940. 10 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ clResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ 23 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ >listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx906. 23 warnings generated when compiling for gfx1100. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx1102. 23 warnings generated when compiling for gfx940. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx1101. 23 warnings generated when compiling for gfx900. 23 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx942. 23 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx941. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx900. 2 warnings generated when compiling for gfx803. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx940. 2 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx942. 8 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:In file included from 28:21/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:: 10warning: : unused function 'collNetIflush' [-Wunused-function] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | s28t | asttiact icco nnsctc lcRheasru*l tc_otl lcNoeltlNNaemteI(fsltursuhc(ts tnrcuccltC onmcmc*l Ccoommmm*) c{o mrme,t uvroni dc*o mcmo-l>lnCcocmlmC,o lvloNiedt*- >dnaatmae,; i}n t | s ^~~~~~~~~~~i ze, void* mhandle, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hv:o17i:d21*:* warning: runused function 'collNetDevices' [-Wunused-function]e quest) { NCCLCH E17C | Ks(tcaotmimc- >nnccccllRCeoslullNte_tt- >ciofllluNseht(DceovlilcCeosm(ms,t rduactta ,n cscilzCeo,m mm*h acnodmlme,, irnetq*u ensdte)v)); {r eNtCuCrLnC HnEcCcKl(Scuocmcme-s>sn;c c}l C o| l ^~~~~~~~~~~~~l Net->devices(ndev)); retur/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hn: 29n:c21c:l Swarning: uunused function 'collNetTest' [-Wunused-function]c cess; } | ^~~~~~~~~~~~~~ 29/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h | :s18t:a21t:i cwarning: unused function 'collNetGetProperties' [-Wunused-function]n cclResult_t collNe t18T | esstta(tsitcr uncctc lnRcecsluClotm_mt* ccoolmlmN,e tvGoeitdP*r orpeeqruteisets,( sitnrtu*c td onncec,l Cionmtm** sciozmem), {i nNtC CdLeCvH,E CnKc(ccloNmemt-P>rnocpcelrCtoilelsN_ett*- >ptreospts()r e{q uNeCsCtL,C HdEoCnKe(,c osmimz-e>)n)c;c lrCeotlulrNne tn-c>cgleStuPcrcoepsesr;t i}e s (| d ^~~~~~~~~~~e v, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hp:r30o:p21s:) )warning: ;unused function 'collNetCloseColl' [-Wunused-function] return ncclS u30c | csetsast;i c} n c| c ^~~~~~~~~~~~~~~~~~~~l Res/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hu:l19t:_21t: cwarning: ounused function 'collNetListen' [-Wunused-function]l lNetCloseColl(stru c19t | sntcactliCco mnmc*c lcRoemsmu,l tv_oti dc*o lcloNleltCLoimsmt)e n{( sNtCrCuLcCtH EnCcKc(lcCoommmm-*> nccocmlmC,o lilnNte td-e>vc,l ovsoeiCdo*l lh(acnodllleC,o mvmo)i)d;* *r eltiusrtne nnCcocmlmS)u c{c eNsCsC;L C}H E C| K ^~~~~~~~~~~~~~~~( com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hm:-31>:n21c:c lwarning: Cunused function 'collNetCloseListen' [-Wunused-function]o llNet->liste n31( | dsetva,t ihca nndclcel,R elsiusltte_ntC ocmoml)l)N;e trCeltousrenL insctcelnS(usctcreuscst; n}c c l| C ^~~~~~~~~~~~~o mm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h*: 20c:o21m:m ,warning: unused function 'collNetConnect' [-Wunused-function]v oid* listenComm) { NCCLCH E20C | Ks(tcaotmimc- >nnccccllRCeoslullNte_tt- >ccollolsNeeLtiCsotnenne(clti(sstternuCcotm mn)c)c;l Croemtmu*r nc onmcmc,l Svuocicde*s sh;a n}d l e| s ^~~~~~~~~~~~~~~~~~[ ], /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hi:n33t: 12n:r awarning: nunused function 'collNetSupport' [-Wunused-function]k s, int ra n33k | ,s tvaotiidc* ilnits tceonlCloNmemt,S uvpopiodr*t*( sctorlulcCto mnmc)c l{C oNmCmC*L CcHoEmCmK)( c{o mrme-t>unrcnc lcCoomlml-N>entc-c>lcCoonlnleNcett( h!a=n dnluelsl,p tnrr a?n k1s ,: r0a;n k}, l| i ^~~~~~~~~~~~~~s tenComm, collComm));In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.ccr:e11t: u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hr:n195 :n21c:c lwarning: Sunused function 'ncclTopoIdToIndex' [-Wunused-function]u ccess; } | ^~~~~~~~~~~~~~ 195 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ht:a21t:i21c: nwarning: cunused function 'collNetReduceSupport' [-Wunused-function]c lResult_t ncclTopoIdT o21I | nsdteaxt(isct rnuccctl RnecscullTto_pto ScyosltleNme*t RseydsutceemS,u pipnotr tt(yspter,u citn tn6c4c_ltC oimdm,* icnotm*m ,i nndcecxl)D a{t a T| y ^~~~~~~~~~~~~~~~~p e_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h :d206a:t21a:T ywarning: punused function 'ncclTopoRankToIndex' [-Wunused-function]e , ncclRe d206O | ps_tta triecd Onpc,c liRnets*u lstu_ptp onrctceldT)o p{o RNaCnCkLTCoHIEnCdKe(xc(osmtmr-u>cntc cnlcCcollTloNpeotS-y>srteedmu*c esSyusptpeomr,t (idnatt arTaynpke,, irnetd*O pi,n dseuxp)p o{r t e| d ^~~~~~~~~~~~~~~~~~~) ); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hr:e217t:u21r:n warning: nunused function 'ncclTopoDevToRank' [-Wunused-function]c clSucces s217; | s}t a t| i ^~~~~~~~~~~~~~~~~~~~c ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hl:R22e:s21u:l twarning: _unused function 'collNetRegMr' [-Wunused-function]t ncclTopoDevToR a22n | ks(tsattriucc tn cncclcRleTsouplotS_yts tceoml*l NseytsRteegmM,r (isnttr udcetv ,n cicnltC*o mrma*n kc)o m{m , | v ^~~~~~~~~~~~~~~~~o id*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h :c229o:l14l:C owarning: munused function 'ncclTopoXGMISpeed' [-Wunused-function]m , void *229 | dsattaat,i ci nftl osaitz en,c cilnTto ptoyXpGeM,I Svpoeiedd*(*c omnhsatn dclhea)r *{ gNcCnC)L C{H E C| K ^~~~~~~~~~~~~~~~~( comm->ncclCollNet->In file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cce:g14M: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h(:c161o:l14l:C owarning: munused function 'ncclGdrInit' [-Wunused-function]m , dat a161, | sstiaztei,c tgydpre_,t mnhcacnldGlder)I)n;i tr(e)t u{r n | n ^~~~~~~~~~~c clSucc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.he:s206s:;21 :} warning: unused function 'ncclGdrCudaFree' [-Wunused-function]| ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :20624 | :s21t:a twarning: iunused function 'collNetRegMrDmaBuf' [-Wunused-function]c ncclResult_t nccl G24d | rsCtuadtaiFcr enec(cvloRieds*u lgtd_rtH acnodllleN)e t{R e g| M ^~~~~~~~~~~~~~~r DmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx1102. 28 warnings generated when compiling for gfx900. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for host. 28 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx1030. 45 warnings generated when compiling for gfx900. 45 warnings generated when compiling for gfx908. 45 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ >name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx803. 45 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx941. 45 warnings generated when compiling for gfx1102. 45 warnings generated when compiling for gfx906. 45 warnings generated when compiling for gfx940. 45 warnings generated when compiling for gfx90a. 45 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ 3 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclReIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ sult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 21 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclRIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.ccs:u8l: tIn file included from _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.ht: 11c: oIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.hl:N12e: tIn file included from C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.hl:o124s: eIn file included from C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.ho:l14l: (In file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.ht:r60u: cIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h :n14c: c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.hl:C40o:m13m:* warning: cunused function 'log2i' [-Wunused-function]o mm, void* collCom m40) | s{t aNtCiCcL ClHoEnCgK (lcoogm2mi-(>lnocncgl Cno)l l{N e t| - ^~~~~> closeColl(In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cco:l9l: C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ho:m17m:)21):; warning: runused function 'collNetDevices' [-Wunused-function]e turn ncclSu c17c | esstsa;t i}c n| c ^~~~~~~~~~~~~~~~c lRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hs:u31l:t21_:t warning: cunused function 'collNetCloseListen' [-Wunused-function]o llNetDevices (31s | tsrtuactti cn cncclcCloRmems*u lcto_mtm ,c oilnltN*e tnCdleovs)e L{i sNtCeCnL(CsHtErCuKc(tc onmcmc-l>Cnocmcml*C oclolmNme,t -v>odiedv*i cleiss(tnedneCvo)m)m;) r{e tNuCrCnL CnHcEcClKS(uccocmems-s>;n c}c l C| o ^~~~~~~~~~~~~~l lNe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ht:-18>:c21l:o swarning: eunused function 'collNetGetProperties' [-Wunused-function]L isten(listenCo m18m | )s)t;a triect unrcnc lnRcecsluSlutc_cte scso;l l}N e t| G ^~~~~~~~~~~~~~~~~~e tP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hr:o33p:e12r:t iwarning: eunused function 'collNetSupport' [-Wunused-function]s (struct n c33c | lsCtoamtmi*c cionmtm , int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclCcollNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ omm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx906. 21 warnings generated when compiling for gfx803. 21 warnings generated when compiling for gfx900. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx940. 21 warnings generated when compiling for gfx941. 21 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx941. 5 warnings generated when compiling for gfx1100. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx940. 5 warnings generated when compiling for gfx900. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx1102. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx1101. 5 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx942. 5 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t In file included from xmlRemove/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.ccN:o9d: eIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.hs:t60r: uIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.ht: 14n: cc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.hl:X40m:l13N:o dwarning: eunused function 'log2i' [-Wunused-function]* node) { | ^~~~~~~~~~~~~ 40 | stati/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hc: 276l:o21n:g warning: lunused function 'kvConvertToInt' [-Wunused-function]o g2i(long n) { | ^~~~~276 | static ncclResult_t kvConvIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.ccr:t28T: o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hI:n94t:(21c: owarning: nunused function 'xmlGetAttrInt' [-Wunused-function]s t char* st r94, | sitnatt*i cv anlcucel,R essturlutc_tt kxvmDliGcett*A tdtirInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(stct) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ruct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx942. 17 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx900. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx940. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx803. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx941. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: 7 warnings generated when compiling for gfx900. in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t datIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp1:,1 : fIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:g101: ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:a169t: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h2:,271 :f19l:a gwarning: 2unused variable 'ptr' [-Wunused-variable]; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ ireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h21:: 386warning: :unused variable 'flag1' [-Wunused-variable]9 : warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 153 | u386i | n t 3 2 _itn td awtiar1e,O ffflsaegt1 ,= dWaitrae2W,o rfdlPaegr2S;l i c| e ^~~~~* war/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hp: 153+: 282:* wwarning: iunused variable 'data2' [-Wunused-variable]d ; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, fIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cppa:g12: ;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 10 ^~~~~: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h::169153: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h35::271 :warning: 19unused variable 'flag2' [-Wunused-variable]: warning: unused variable 'ptr' [-Wunused-variable] 153 | uint3 2271_ | t d a t a 1 , ufilnatg614,_ td*a tpat2r, =f lraegc2v;P t r| ( ^~~~~0 )+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: 7expanded from macro 'IMPL_COLL_FUNC' warnings generated when compiling for gfx941. 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 25 warnings generated when compiling for gfx90a. 25 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSpliIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(tt, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ hreadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warp7InBloc warningks( generatedth when compiling for rgfx1030e. adIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | In file included from warpInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cppl:o1c: kIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h10r: eIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:I169d: x./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hx:/509W:A29R:P _warning: Sfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]I ZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagT h507r | e a d ( (ttiidd(%t4i)d=)=,3 )n,t hgrreoaudps((gnrtohurpe)a,d s| ) ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, w| i warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3d (tid%WA R510P | _ S I Z Es)t,e pwSairzpe((tnicdc/lWSAhRmPe_mS.IcZoEm)m,. b u| f ~~~~~~~~~~~~~~~~~~f S i| z stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)e s[NCC L508_ | P R O T Ow_aLrLp1I2n8B]l/oNcCkC(Lt_hSrTeEaPdSI/dsixz.exo/fW(AuRiPn_tS6I4Z_Et)),) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| warp(tid/WARP_SIZE| group(group 509 | flagThread((tid%4)==3)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 451g:r9o:u pnote: (in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested hereg roup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ 451 | | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 p r510i | m s ( t isdt,e pnStihzree(andcsc,l Sthrmeeem-.>cdoomwmn.,b utfrfeSei-z>edso[wNnC,C La_rPgRsO-T>Os_eLnLd1b2u8f]f/,N CaCrLg_sS-T>ErPeSc/vsbiuzfefo,f (aurignst-6>4r_etd)O)p A{r g )| ; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:491e:e9S:p lnote: iin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested heret (args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5091 | : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 10f: lIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hg:T169h: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.he:a509d:(29(:t iwarning: dfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]% 4)==3), group(group) ,507 | | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ t| i warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3d (tid), n510t | h r e a dsst(enptShirzeea(dnsc)c,l Swhimde(mt.icdo%mWmA.RbPu_fSfISZiEz)e,s [wNaCrCpL(_tPiRdO/TWOA_RLPL_1S2I8Z]E/)N,C C L| _ ~~~~~~~~~~~~~~~~~~S T E| P stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)S /size o508f | ( u i n tw6a4r_ptI)n)B l{o c k| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa dIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9 :509 | note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here flagThr e533a | d ( ( t i d % 4 )p=r=i3m)s,( tgid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ roup(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads)7 warnings generated when compiling for gfx1030. , wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ readsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cppd:/1W: AIn file included from R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:_10S: IIn file included from Z/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hE:)169,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h :| 509 ~~~~~~~~~~~~~~~~~~: 29 :| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 508 | w507a | r p I n Btliodc(kt(itdh)r,e andtIhdrxe.axd/sW(AnRtPh_rSeIaZdEs)),, w| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ( t| i warp(tid/WARP_SIZEd %WARP _509S | I Z E ) ,f lwaagrTph(rteiadd/(W(AtRiPd_%S4I)Z=E=)3,) , | g ~~~~~~~~~~~~~~~~~~r o u| p stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)( gr o508u | p ) , w| a ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~r p I| n warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3B lock(th r510e | a d I d xs.txe/pWSAiRzPe_(SnIcZcEl)S,h m e| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. c o| m warp(tid/WARP_SIZEm .buffS i509z | e s [ N CfClLa_gPTRhOrTeOa_dL(L(1t2i8d]%/4N)CCL_ST==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit6(4a_rtg)s)) ;{ | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 533 : 9 : Rnote: uin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested heren WorkElemen t533< | F n , T , R epdrOipm,s (Atligdo-,n tPhrroetaod>s(S)p.lriutn,( wnet)h;r e a| d ^s -nthreadsSp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cppl:i11t:,1 :& tnote: rin instantiation of member function 'RunWork, 0, 1>::run' requested heree e->u p11, | ItMrPeLe_-C>OdLoLw_nF,U NaCr(gAsl-l>Rseednudcbeu,f fT,R EaEr,g sL-L>1r2e8c,v bMuifnf,, h a| l ^f ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95 :994 | note: expanded from macro 'IMPL_COLL_FUNC' runTre e391S | p l iRtu,( atrygpse),; F u| n ^c ##devredo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:<202t:y53p:e >note: ,in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here NCCL_ A202L | G O _ # # a l g oR,u nNWCoCrLk_EPlReOmTeOn_t#<#Fpnr,o tTo,> (R)e.drOupn,( &Anlcgcol,S hPmreomt.ow>o(r)k.)r;u n\( w e| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPer/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hSlice:*w514a:r9p: +warning: variable 'offset' set but not used [-Wunused-but-set-variable]2 *wi 514 | din;t o| f ^f set = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hCL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 0, 1>::run' requested hereL L128]/NCCL_STE P202S | / s i z e o f ( uRiunntW6o4r_ktE)l)e m{e n t| < ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F n ,| group(groupT , RedOp, Algo, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o451t:o9>:( )note: .in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested herer un(we); 451| | ^ prims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cppt:i6d:,1 :n tnote: hin instantiation of member function 'RunWork, 0, 1>::run' requested herer eads, 6t | rIeMeP-L>_dCoOwLnL,_ FtUrNeCe(-A>ldloRwend,u caer,g sT-R>EsEe,n dLbLu1f2f8,, aPrrgesM-u>lrSeucmv,b uufifn,t 8a_rtg)s - >| r^e dOpArg);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^95 : note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here391 | R u994n | W o r k d(oapr ,| ^N CCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:#202#:a53l:g onote: ,in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here NCCL_ P202R | O T O _ # # p r oRtuon>W(o)r.krEulne(m&ennctc().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ readsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ^ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | RtuindW(otrikd<)n,c cnltFhurneca#d#sf(unntch,r etaydpse),, FtuindcI#n#Bdleovcrke(dtohpr, NICdCxL._xA)L,G Og_r#o#uapl(ggor,o uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O T O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #proto> (563) | . r u n (s&tnecpcSliSzhem(enmc.cwloSrhkm)e;m .\c o m| m ^. buffSizes[NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_15S:I Mnote: Pfield 'nthreads' will be initialized after field 'tidInBlock'L E]/NCCL_ST E562P | S / s i zteiodf((tTi)d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I275d:x90.:x )note: ,in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here group(gro u275p | ) , | ^~~~~~~~~~~~~~~~~ Prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:t562i:v60e:s t,i d/I*nDBilroecckt(=t*h/r0e,a dPIrdoxt.ox,) ,0 >g rporuipm(sg r o| u ^p ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~595 :5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hNCCL_PROTO:_562#:#15p: warning: initializer order does not match the declaration order [-Wreorder-ctor]r oto>().run(&ncclShmem.work); \ | 562 ^ | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, note: tfield 'nthreads' will be initialized after field 'tidInBlock'i dInBlock(threadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~~~~~~~L _PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:S60I:M Pnote: Lfield 'group' will be initialized after field 'stepSize'E ]/NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(275g:r90o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~ 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTree/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15R:u nwarning: Winitializer order does not match the declaration order [-Wreorder-ctor]o rkElementh(r)e.ardusn((nwteh)r;e a d| s ^) , tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppc:k7(:t1h:r enote: ain instantiation of member function 'RunWork, 0, 2>::run' requested hered Idx.x) ,7 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l l R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d uce, TR E563E | , S I MsPtLeEp,S iSzuem(PnosctcDliSvh,m eumi.ncto3m2m_.tb)u f f| S^i zes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:P391R:O95T:O _note: Sexpanded from macro 'IMPL_COLL_FUNC'I MPLE]/NCC L391_ | S T ERPuSn/Wsoirzke, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree >, NCCL_ A324L | G O _ # # a lPgroi,m iNtCiCvLe_sPn(A)s.yrmumne(t&rniccc, /*Dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562t:=15*:/ 0note: ,field 'nthreads' will be initialized after field 'tidInBlock' Proto, 0> 562p | r i m s t i| d ^( tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h595r:e5a:d snote: (in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heren threa d595s | ) , t irduInnTBrleoecUkp(Dtohwrne >(ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562):;60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hered (tid) ,202 | n t h r e a d s (RnutnhWroerakdEsl)e,m etnitdp(()g.rrouunp()w,e ) ;| ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)gr oup), | ^~~~~~~~~~~~~~~~~ 563 | ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:S562:60: note: field 'group' will be initialized after field 'stepSize' ize(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx940. 19 warnings generated when compiling for gfx941. 19 warnings generated when compiling for gfx90a. 19 warnings generated when compiling for gfx90a. 19 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 19 warnings generated when compiling for gfx900. 19 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 19 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 19 warnings generated when compiling for gfx906. 19 warnings generated when compiling for gfx1100. 19 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for host. 19 warnings generated when compiling for gfx1101. 19 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthreads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~r eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n15B:l owarning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (threadIdx.x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'group' will be initialized after field 'stepSize':562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :warning: 60initializer order does not match the declaration order [-Wreorder-ctor]: note: field 'group' will be initialized after field 'stepSize' 562 | ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 202: | 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWorkElemente(a)d.sr(unnt(hwree)a;d s )| , ^ tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppc:k8(:t1h:r enote: ain instantiation of member function 'RunWork, 0, 2>::run' requested hered Idx. x8) | ,I MgPrLo_uCpO(LgLr_oFuUpN)C(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]x .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid( t563i | d ) , nsttherpeSaidzse((nntchcrleSahdmse)m,. ctoimdmI.nbBulfofcSki(ztehsr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P S /| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i zeof(T )563) | { | s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t e p| S group(groupi ze(ncclShmem.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hb:u275f:f90S:i znote: ein instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres [NCCL_PR O275T | O _ S I M P LPEr]i/mNiCtCiLv_eSsT, /*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hD:i324r:e90c:t =note: *in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/ 0, Proto ,324 | 0 > p r i mPsr i m| i ^t ives, ProtoSimple<1, 1>>' requested here FanAs y595m | m e t r ircuP,rotoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreadIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>(). NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]( nthreads), tidInBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~) , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx906. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t15e:p Swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z e(ncclShmem.comm.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :324:90: note: 563in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ste p324S | i z e ( n c cPlrSihmmietmi.vceosmo,f (/T*)D)i r{e c t| = ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~* / 0| , group(group Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here595 :5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 324 | 595 | P rriumniTtrieveeUsp_>M(AaXr_gDsE)V;_ A R| I ^T Y>, /*D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r202e:c53t:= *note: /in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here0 , Pro t202o | , 0 > p r i mRsu n W| o ^r kEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runnt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15ou:p )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 562 | s t e ptSiidz(et(indc)c,l Snhtmherme.acdosm(mn.tbhurfefaSdisz)e,s [tNiCdCILn_BPlRoOcTkO(_tShIrMePaLdEI]d/xN.CxC)L,_ SgTrEoPuSp/(sgirzoeuopf)(,T ) )| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~{ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 563 | stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n324c:c90l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.comm. b324u | f f S i z e sP[rNiCmCiLt_iPvReOsT, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] CL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_MAX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hOLL_FU:N562C:(15A:l lwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]e duce, TREE, SIMPLE, Min, int 65624 | _ t ) t| i^d (tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBlock(thre a391d | I d xR.uxn)W,o rgkrn,c cNlCSChLm_eAmL.GcOo_m#m#.ablugfof,S iNzCeCsL[_NPCRCOLT_OP_R#O#TpOr_oStIoM>P(L)E.]r/uNnC(C&Ln_cScTlESPhSm/esmi.zweoorfk()T;) )\ | ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i324d:(90t:i dnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nthread s324( | n t h r e a dPsr)i,m ittiidvIensB ,note: field 'group' will be initialized after field 'stepSize'/ *Direct =562* | / 0 , Ptriodt(ot,i d0)>, pnrtihmrse a d| s ^( nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:)595,: 5t:i dnote: Iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heren Block( t595h | r e a d Irduxn.Txr)e,e UgprDoouwpn(>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hid(tid),: 562nt:h15r:e awarning: dsinitializer order does not match the declaration order [-Wreorder-ctor]( nthreads), tidInBlock(threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize'I dx.x), group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), nthrea d563s | ( n t h rsetaedpsS)i,z et(indcIcnlBSlhomcekm(.tchormema.dbIudfxf.Sxi)z,e sg[rNoCuCpL(_gPrRoOuTpO)_,S I M| P ^~~~~~~~~~~L E]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 563562 | | sttiedp(Stiizde)(,n cnctlhSrhemaedms.(cnotmhmr.ebaudfsf)S,i zteisd[INnCBClLo_cPkR(OtThOr_eSaIdMIPdLxE.]x/)N,C CgLr_oSuTpE(PgSr/osuipz)e,o f (| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ) | { tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 | | group(group stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:E275P:S90/:s inote: zin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~275 | | group(group Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer ice,s c 5,: /note: *in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereD irect =595* | / 0 , PrruontTor,e e0U>p Dporwinm, ProtoSimple<1, 1>>' requested heree <1, 1 >595> | ( a r g sr)u;n T r| e ^e UpDown, 0, 2>::run' requested hereo Simple <2021 | , 1 > > ( a r gRsu)n;W o r| k ^E l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:m202e:n53t:< Fnote: nin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here, T, R e202d | O p , A l g o ,R uPnrWootrok>E(l)e.run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##phrroetaod>I(d)x..rxu)n,( &gnrcoculpS(hgmreomu.pw)o,r k )| ; ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uiIn file included from nt32_t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppda:t1a: 1,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:l10a: gIn file included from 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h,: 169d: at/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.ha2:,271 :f19l:a gwarning: 2unused variable 'ptr' [-Wunused-variable]; | ^~~~~ 271 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153 :u21i:n twarning: 6unused variable 'flag1' [-Wunused-variable]4 _t* ptr = r153e | c v P t ru(i0n)t+3l2l_1t2 8dOaftfas1e,t ;f l a| g ^~~1 , data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTree/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hUpDo:w562n:<15T:, warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]e dOp, ProtoSimple<1, 1>>(a r562g | s ) ; t| i ^d (tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l202o:c53k:( tnote: hin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herer eadIdx.x) ,202 | g r o u p ( g r oRuupn)W,o r k| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l e m| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n t().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0In file included from >/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp :p1r: iIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs: 10 : | In file included from ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::15595:: 5warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTr e562e | U p D o wtni >t(iadrIgnsB)l;o c k| ( ^ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:r562e:e15U:p Dwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]w ni>d((atrigds)),; n t| h ^r eads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s )note: ,in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here tidIn B202l | o c k ( t h r e aRduIndWxo.rxk)E,l egmreonutp<(Fgnr,o uTp,) ,R e d| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p , | A tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l go, Pro t563o | > ( ) . rsutne(pwSei)z;e ( n| c ^c lShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppm:m12.:b1u:f fnote: Sin instantiation of member function 'RunWork, 0, 2>::run' requested herei zes[N C12C | LI_MPPRLO_TCOO_LSLI_MFPULNEC](/ANlClCRLe_dSuTcEeP,S /TsRiEzEe,o fS(ITM)P)L E{, P| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o d ,| group(groupd ouble) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :expanded from macro 'IMPL_COLL_FUNC'324 :90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | Ru n324W | o r k < n c cPlrFiumnict#i#vfeusnC,C LN_CMCALX__ADLEGVO__A#R#IaTlYg>o,, /N*CDCiLr_ePcRtO=T*O/_0#,# pPrroottoo>,( )0.>r upnr(i&mnsc c l| S ^h mem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:k595):;5 :\ note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here| ^ 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | rtuindT(rteiedU)p,D onwtnh(>t(harregasd)I;d x .| x ^) , group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 60 : note: field 'group' will be initialized after field 'stepSize' RunWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r ePardost)o,> (t)i.drIunnB(lwoec)k;( t h| r ^e adIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp,: 4g:r1o:u pnote: (in instantiation of member function 'RunWork, 0, 2>::run' requested hereg roup )4, | I M| P ^~~~~~~~~~~L _COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here < 595 | runTNrCeCeLU_pMDAoXw_nDr,o t/o*SDiimrpelcet<=1*,/ 01,> >P(raortgos,) ;0 > | p ^r ims | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hW:o595r:k5E:l enote: min instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree nt<(T),. rRuend(Owpe,) ;P r o| t ^o Simple<1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp,: 51:>1>:( anote: rin instantiation of member function 'RunWork, 0, 2>::run' requested hereg s); | 5 ^ | IMPL_COLL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hF:U202N:C53(:A lnote: lin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereR educe, T202R | E E , S I M P LREu,n WPorrokdE,l eumienntt8<_Ftn), T| ,^ RedOp, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:l391g:o95,: Pnote: rexpanded from macro 'IMPL_COLL_FUNC'o to>().run (391w | e ) ;R u n| W ^o rk, 0, 2>::run' requested herec , ty p5e | ,I MFPuLn_cC#O#LdLe_vFrUeNdCo(pAu,c eN,C CTLR_EAEL,G OS_I#M#PaLlEg,o ,P rNoCdC,L _uPiRnOtT8O__t#)# p r| o^t o>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(391&:n95c:c lnote: Sexpanded from macro 'IMPL_COLL_FUNC'h mem.work) ;391 | \ R| u ^n Workl,o cNkC(CtLh_rAeLaGdOI_d#x#.axl)g,o ,g rNoCuCpL(_gPrRoOuTpO)_,# # p| r ^~~~~~~~~~~~~~~~~o to>().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e60m:. wnote: ofield 'group' will be initialized after field 'stepSize'r k); \ 562| | ^ tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock') , tidInBlock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads),) tidInBl o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 324 | P562r | i m i t itvieds(.,x )/,* group(Dgirroeucpt)=,* / 0| , ^~~~~~~~~~~ Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolcSkh(mtehmr.ewaodrIkd)x;. x\) , | g ^r oup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,15 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~field 'nthreads' will be initialized after field 'tidInBlock' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562563 | | tsitde(ptSiidz)e,( nnctchlrSehamdesm(.nctohmrme.abdusf)f,S itziedsI[nNBClCoLc_kP(RtOhTrOe_aSdIIMdPxL.Ex])/,N CgCrLo_uSpT(EgPrSo/uspi)z,e o f| ( ^~~~~~~~~~~~~~~~~T )) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'group' will be initialized after field 'stepSize' group(group 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d275s:(90n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads), t i275d | I n B l o c kP(rtihmrietaidvIedsx<.Tx,) ,R egdrOopu,p (FgarnoAuspy)m,m e t| r ^~~~~~~~~~~i c, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgs); | ^ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | 562 | R u n Wtoirdk(Etliedm)e,n tn(s)).,r utni(dwIen)B;l o c| k ^( threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.:x7):,1 :g rnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested hereu p(gro u7p | )I,M P L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C O L| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ FUNC(Al l563R | e d u c es,t eTpRSEiEz,e (SnIcMcPlLSEh,m ePmr.ocdo,m mu.ibnutf3f2S_itz)e s [| N^C CL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I391M:P95L:E ]note: /expanded from macro 'IMPL_COLL_FUNC'N CCL_STEP S391/ | s i zReuonfW(oTr)k)< n{c c l| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F u n| c group(group# #func, type, Func##devredop, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:C324L:_90A:L Gnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ ##algo, N324C | C L _ P R O TPOr_i#m#iptriovteos><(T),. rRuend(O&pn,c cFlaSnhAmseymm.mweotrrki)c;< 1\, N| C ^C L_MAX_DEV_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:I562T:Y15>:, note: /field 'nthreads' will be initialized after field 'tidInBlock'* Direct=*/ 0562, | P r o ttoi,d (0t>i dp)r,i mnst h r| e ^a ds(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d595s:)5,: tnote: iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hered InBlo c595k | ( t h r eraudnITdrxe.exU)p,D ogwrno>(arg s562) | ; | ^t id(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heret hread s202) | , t i d I n B lRunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDowninitializer order does not match the declaration order [-Wreorder-ctor]> (args); | ^ 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rein instantiation of member function 'RunWorkElement, 0, 2>::run' requested herea ds), tidInB l202o | c k ( t h r e a dRIudnxW.oxr)k,E lgermoeunpt( ( ) . rsutne(pwSei)z;e ( n| c ^c lShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppm:m7.:b1u:f fnote: in instantiation of member function 'RunWork, 0, 2>::run' requested hereS izes[ N7C | CILM_PPLR_OCTOOL_LS_IFMUPNLCE(]A/lNlCRCeLd_uScTeE,P ST/RsEiEz, eSoIfM(PTL)E), {P r o| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, u| i group(groupn t32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | Run W324o | r k < n c c lPFruinmci#t#ifvuensc<,T ,t yRpeed,O pF,u nFca#n#AdseyvmrmeedtorpiN,C CNLC_CMLA_XA_LDGOE_V#_#AaRlIgTo,Y >N,C C/L*_DPiRrOeTcOt_=#*#/p0r,o tPor>o(t)o.,r u0n>( &pnrcicmlsS h m| e ^m .work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :\595 : 5| : ^ note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15r:u nnote: Tfield 'nthreads' will be initialized after field 'tidInBlock'r eeUpDown< T562, | R e d Otpi,d (Ptriodt)o,S inmtphlreet>h(raeragdss));, t| i ^d InBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53I:d xnote: .in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herex ), gr o202u | p ( g r o u p ) ,R u n| W ^~~~~~~~~~~~~~~~~o rkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562n:t60<:F nnote: ,field 'group' will be initialized after field 'stepSize' T, Red O562 | p , A ltgiod,( tPirdo)t,o >n(t)h.rreuand(sw(en)t;h r e| a ^d s), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppB:l7o:c1k:( tnote: hin instantiation of member function 'RunWork, 0, 2>::run' requested herer eadId x7. | xI)M,P Lg_rCouOpL(Lg_roFuUpN)C,( A l| l ^~~~~~~~~~~R educe, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 15warning: :initializer order does not match the declaration order [-Wreorder-ctor] warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s563t | e p S i zset(enpcScilzSeh(mnecmc.lcSohmmme.mb.ucfofmSmi.zbeusf[fNSCiCzLe_sP[RNOCTCOL__SPIRMOPTLOE_]S/INMCPCLLE_]S/TNECPCSL/_sSiTzEePoSf/(sTi)z)e o{f ( T| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) {| group(group | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~~~~~~~15 : warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | t i562d | ( t i d )t, indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr), group(group), | ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: 7 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_## a562l | g o , NtCiCdL(_tPiRdO)T,O _n#t#hprreoatdos>((n)t.hrruena(d&sn)c,c ltSihdmIenmB.lwoocrkk()t;h r\e a d| I ^d x.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:( gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(ti 563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzhesr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~~~~~~~P S/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:o60f:( T)note: )field 'group' will be initialized after field 'stepSize' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562| | group(group tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a275d:s90(:n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads), t275i | d I n B l o cPkri(mtihtrievaedsI, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562595::155:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUp D562o | w n < T ,t iRde(dtOipd,) ,P rnotthorSeiamdpsl(end>s()a,r gtsi)d;I n B| l ^o ck(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x202.:x53):, note: gin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herer oup(g r202o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W orkEle m563e | n t < F ns,t eTp,S iRzeed(Onpc,c lASlhgmoe,m .Pcroomtmo.>b(u)f.frSuinz(ewse[)N;C C L| _ ^P ROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppP:L9E:]1/:N Cnote: Cin instantiation of member function 'RunWork, 0, 2>::run' requested hereL _STE P9S | /IsMiPzLe_oCfO(LTL)_)F U{N C (| A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l l R| e group(groupd uce, TREE, SIMPLE, PreM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:l275S:u90m:, note: uin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei nt64_t) 275 | | ^ Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:m391it:i95v:e snote: , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, ProtoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ymmetric, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hn:t5143:29_:t warning: dvariable 'offset' set but not used [-Wunused-but-set-variable]a ta1, flag 1514, | d a t ai2n,t folfafgs2e;t =| ^~~~~t id; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :| 153 ^: 21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp::1386: :In file included from 9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 10warning: : variable 'wireOffset' set but not used [-Wunused-but-set-variable]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9 :153 | warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] uint3 2386_ | t d a tian1t, wfilraegO1f,f sdeatt a=2 ,W ifrleaWgo2r;d P e| r ^~~~~S lic/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.he:*153w:a21r:p warning: +unused variable 'flag1' [-Wunused-variable] 2*wid ;153 | | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ ice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ata1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | tid (563t | i d ) , snttehprSeiazdes((nnctchlSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/ s i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e of(T)) {563 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ s| t group(groupe pSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:R68O:T56O:_S Inote: Min instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereP LE]/NCCL _68S | T E P S /Psriizmeiotf(iTv)e)s <{T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| group(groupO p, FanSymmetric<1>, 0, Proto, 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>: 68p:r56i:m snote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here | ^ 68 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 588P:r5i:m inote: tin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herei ves< T588, | R e d Orpu,n RFianngSP,r o0t,o >P(raortgos,) ;0 > | p ^r ims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here588 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 202 | 588 | RruunnWRoirnkg(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Element().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid( t563i | d ) , nsttherpeSaidzse((nntchcrleads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC L562_ | A L G O _t#i#da(ltgiod,) ,N CnCtLh_rPeRaOdTsO(_n#t#hprreoatdos>)(,) .triudnI(n&BnlcocclkS(htmherme.awdoIrdkx).;x )\, g| r ^o up(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,note: field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~~~~~~~i zeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:(562T:)60): {note: field 'group' will be initialized after field 'stepSize' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,68 :t56i:d Inote: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereB lock(t h68r | e a d I dPxr.ixm)i,t igvreosu, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]) , | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t60i:d (note: tfield 'group' will be initialized after field 'stepSize'i d), nthreads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~c lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flIn file included from ag/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp2:;1 : In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~: 10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h153::16735: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :warning: 562unused variable 'flag2' [-Wunused-variable]: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]153 | uint32_t data1, fl a562g | 1 , d attiad2(,t ifdl)a,g 2n;t h r| e ^~~~~a ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.In file included from w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cppo:r1k: )In file included from ;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :\10 : In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h ^: 167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::15 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~( gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,60 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~15 : | warning: group(groupinitializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t68i:d56(:t inote: din instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here) , nth r68e | a d s ( nPtrhirmeiatdisv)e,s g,r o0u,p (Pgrrootuop,) ,0 > | p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r i m| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 588 :s5t:e pnote: Sin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested herei ze(n c588c | l S h m ermu.ncRoimnmg.R(OaTrOg_sS)I;M P L| E ^] /NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:s202i:z53e:o fnote: (in instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereT )) { 202| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:u68n:W56o:r knote: Ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herel ement <68F | n , T ,P rRiemdiOtpi,v eAsl (F)a.nrSuynm(mweet)r;i c <| 1 ^> , 0, Proto, 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp>: 10p:r1i:m snote: in instantiation of member function 'RunWork, 1, 2>::run' requested here | ^ 10 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hM:P588L:_5C:O Lnote: Lin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here_ FUNC (588A | l l R e druucneR,i nRgI,( ahraglsf)); | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202391::5395:: note: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereexpanded from macro 'IMPL_COLL_FUNC' 202 | 391 | R uRnuWnoWrokro(p)<.tryupne(>w,e )N;C C L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 15 : snote: tfield 'nthreads' will be initialized after field 'tidInBlock'e pSize(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 68t:i56d:( tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hered ), nt h68r | e a d s (Pnrtihmrietaidvse)s,< Tt,i dRIendBOlpo,c kF(atnhSryemamdeItdrxi.cx<)1,> ,g r0o,u pP(rgortoou,p )0,> p| r ^~~~~~~~~~~i ms | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~u p(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60):, note: field 'group' will be initialized after field 'stepSize'| ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :\562 : 15| : ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :563 | note: field 'group' will be initialized after field 'stepSize' step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:o562r:k15):; warning: \initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:f562(:T15):) warning: {initializer order does not match the declaration order [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:)68,: 56n:t hnote: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heree ads(nt h68r | e a d s )P,r itmiidtIinvBelsoo,u p0),, P r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t o ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)0 > prims 563 | | ^ stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:z588e:(5n:c cnote: lin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereS hmem .588c | o m m . bruufnfRSiinzgeP(LaEr]g/sN)C;C L _| S ^T EPS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:(202T:)53): {note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 202 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:k68E:l56e:m enote: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heret R(e)d.Orpu,n (Fwaen)S;y m m| e ^t ric<1>, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp0:,6 :P1r:o tnote: oin instantiation of member function 'RunWork, 1, 2>::run' requested here, 0> p r6i | mIsM P L| _ ^C OLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:(588A:l5l:R enote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereu ce, R588I | N G , SrIuMnPRLiEn,g _(ta)r g s| )^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202expanded from macro 'IMPL_COLL_FUNC': 53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 391 | 202 | R u n W o r k e(>),. rNuCnC(Lw_eA)L;G O _| # ^# algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cppC:C6L:_1P:R Onote: Tin instantiation of member function 'RunWork, 1, 2>::run' requested hereO _##pr o6t | oI>M(P)L._rCuOnL(L&_nFcUcNlCS(hAmlelmR.ewdourcke),; R\I N G| , ^ SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562P:r15e:M unote: lfield 'nthreads' will be initialized after field 'tidInBlock'S um, int 35622 | _ t ) t| i^d (tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthreads) ,391 | t i dRIunnBWloorckk<(ntchcrleFaudnIcd#x#.fxu)n,c ,g rtoyuppe(,g rFouunpc)#,# d e| v ^~~~~~~~~~~~~~~~~r edo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:<562t:y60p:e >note: ,field 'group' will be initialized after field 'stepSize' NCCL_AL G562O | _ # # a ltgiod,( tNiCdC)L,_ PnRtOhTrOe_a#d#sp(rnotthor>e(a)d.sr)u,n (t&indcIcnlBSlhomcekm(.twhorreka)d;I d\x . x| ) ^, grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pnote: )field 'nthreads' will be initialized after field 'tidInBlock', | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Proto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uintIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ Offset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :271514 | : 9 : warning: variable 'offset' set but not used [-Wunused-but-set-variable] uint64_t* p t514r | = r eicnvtP torf(f0s)e+tl l=1 2t8iOdf;f s e| t ^; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:ds562(:n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds), tidInBlock(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s), tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PROTO _563S | I M P L Es]t/eNpCSCiLz_eS(TnEcPcSl/Sshimzeemo.fc(oTm)m). b{u f f| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i z e| s group(group[ NCCL_PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_626S:T9E:P Snote: /in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres izeof(T) )626 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupp rims(tid-tidStartScatter/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626n:T9h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered sScatter ,626 | N U L L , d i rpercitm-s>(utpi,d -atrigdsS-t>asretnSdcbautftfe,r ,a rngTsh-r>eraedcsvSbcuaftft,e r ,| ^N ULL, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202c:t53-:> unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, args -202> | s e n d b u f f ,R uanrWgosr-k>Erleecmvebnutf, 2, 2>::run' requested here> ().ru n202( | w e ) ; | ^ RunWorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppe:m5e:n1t:< Fnote: nin instantiation of member function 'RunWork, 2, 2>::run' requested here, T, Re d5O | pI,M PALl_gCoO,L LP_rFoUtNoC>((A)l.lrRuend(uwcee),; C O| L ^L NET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp :S6I:M1P:L Enote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here SumPos t6D | iIvM,P Lu_iCnOtL8L__tF)U N C| (^A llReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:,391 :C95O:L Lnote: Nexpanded from macro 'IMPL_COLL_FUNC'E T_DIRECT, 391S | I M PRLuEn,W oSrukm, NC C391L | _ A LRGuOn_W#o#raklF(u)n.cr#u#nd(e&vnrcecdloSpho,r kN)C;C L\_ A L| G ^O _##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:,562 :N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'P ROTO_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock'I dx.x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~l ock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r60e:a dnote: Ifield 'group' will be initialized after field 'stepSize'd x.x), g562roup(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 15 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | stepSize(ncclS h562m | e m . c otmimd.(btuifdf)S,i znetsh[rNeCaCdLs_(PnRtOhTrOe_aSdIsM)P,L Et]i/dNICnCBLl_oScTkE(PS/sizeothreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork:,15 :N Cwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]L _ALGO_##algo, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c | k ^( thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'g roup(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nthre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_)S,I M P| L ^~~~~~~~~~~~~~~~~E ]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60S:T Enote: Pfield 'group' will be initialized after field 'stepSize'S /sizeo f562( | T ) ) {t i d| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ) group(group, nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a666d:s9):, note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei dInBloc k666( | t h r e a d I d xp.rxi)m,s (gtriodu,p (ngTrhoruepa)d,s G a| t ^~~~~~~~~~~h er, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(threadIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~a ds(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize') , tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock(th r563e | a d I d xs.txe)p,S igzreo(unpc(cglrSohumpe)m,. c o| m ^~~~~~~~~~~m .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::666202::953:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 666 | 202 | p r i m s ( tRiudn,W onrTkhErleeamdesnGta uApl,g oN,U LPLr,o taor>g(s)-.>rsuenn(dwbeu)f;f , | a ^r gs->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 202:53: note: 4in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | IMPL_ C202O | L L _ F U N C ( ARlulnRWeodrukcEel,e mCeOnLtLt(D)i.vr,u ni(nwte8)_;t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::1391:: 95note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 6 | IMPL _391C | O L LR_uFnUWNoCr(kAi,v ,N CiCnLt_3A2L_GtO)_ # #| a^l go, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L391_:PR95O:T Onote: _expanded from macro 'IMPL_COLL_FUNC'# #proto>( )391. | r u nR(u&nnWcocrlkS, NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolcSkh(mtehmr.ewaodrIkd)x;. x\) , | g ^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)15,: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 60 : note: tfield 'group' will be initialized after field 'stepSize'i d(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: warning: field 'nthreads' will be initialized after field 'tidInBlock'initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:) ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunffBSliozceks([tNhCrCeLa_dPIRdOxT.Ox_)S,I MgPrLoEu]p/(NgCrCoLu_pS)T,E P | S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up), | 563 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) stepSi z563e | ( n c c lsSthempeSmi.zceo(mnmc.cbluSfhfmSeimz.ecso[mNmC.CbLu_fPfRS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hOi:Tz562Oe:_s15S[:IN MCwarning: PCinitializer order does not match the declaration order [-Wreorder-ctor]LL E_]P/RNO CT562CO | L_ _S SI TM EPtPLiSEd/](s/tiNizCdeC)oL,f_ (SnTTt)Eh)Pr Se{/a sd is| z( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~en ot fh| (r group(groupTe )a)d s{) , | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d I| n group(groupB lock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a666d:I9d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:x: .666note: x:in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here)9 ,: gnote: r in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo666 u | p ( g666 r | o u p ) , p r i| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ps r( it| mi tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)sd (,t i nd563T, | h rn eT ah drssetGaeadptsShGieazrte,h( endrci,cr ledScihtrm-ee>cmut.p-c,>o umNpmU,.L bLNuffSizes[UNLC,LC ,La _raPgrRsgO-sT>-Os>_esSneIdnMbdPubLfuEff],f/ ,Na CraCgrLsg_-sS>-Tr>EerPceSvc/bvsubifuzffe,fo ,f (| T ^| ) ^) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h 202: :202| 53: group(group:53 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 626 :202 9 | 202: | note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | R uR nu Wn oW ro kr Ek lEpelrmeiemmnestn(oa(>d)(s.)Sr.curanut(ntw(eewr)e,;) ;N U| L ^| L ^, direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:-:6>8:u:1p1:,: note: anote: in instantiation of member function 'RunWork, 2, 2>::run' requested hererin instantiation of member function 'RunWork, 2, 2>::run' requested here g s- > 6s8 | e | InIMdMPbPLuL_f_CfCO,OL LLaL_r_FgFUsUN-NC>C(r(AeAlcllvlRbReuedfdufuc,ce e, , | C ^CO OLLLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hNN:EE202TT:__53DD:II RRnote: EEin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereCC TT, , 202 S | SI IM MP PL LE E, , S SuRumumPnPoWosostrtDkDiEivlv,e, m ieinnnttt3<62F4_n_t,t) ) T , | | ^R^ e dOp, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: | 391391 :: 9595 :: snote: note: texpanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC'e pSize(nc c391391l | | S h mRReuumnn.WWcooorrmkkm<<.nnbccuccfllfFFSuuinnzcce##s##[ffNuuCnnCccL,,_ PttRyyOppTeeO,,_ SFFIuuMnnPccL##E##]dd/eeNvvCrrCeeLdd_ooSppT<>i,,z eNNoCCfCC(LLT__)AA)LL GG{OO __ ##| ## ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa ll gg| oo group(group,, NNCCCCLL__P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hPR:RO666OT:TO9O_:_# ##note: #pin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herepr roott oo666>> | (( )) .. rr uu nn (( && nnpccrccillmSSshh(mmteeimmd..,ww oonrrTkkh))r;;e a\\d s G| | a ^ ^t her, direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht::-562562>::u1515p::, note: note: Nfield 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock'U LL, 562a | r562 g | s - > ts ietdni(ddtb(iutdfi)fd,,) ,na trnhgtrshe-ra>edrased(csnv(tbnhutrfhefra,ed as d)| s, ^) ,t it/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hdi:Id202nI:Bn53lB:ol conote: kcin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here(k t(ht rh202er | ae da Id dI xd .x x. )x ,)R ,ug nrgWorououprp(k(gEgrlroeoumupep)n),t, < F | n| ^~~~~~~~~~~~~~~~~, ^~~~~~~~~~~~~~~~~ T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562:R:562e60:d:60O :pnote: ,field 'group' will be initialized after field 'stepSize'note: field 'group' will be initialized after field 'stepSize'A l g562o | , 562P | r ot ti od >(t(ti)id.d(r)tu,in d(n)wt,eh )rn;et ah dr| se ^(a ndtsh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp(r:ne8ta:hd1rs:e) a,note: d in instantiation of member function 'RunWork, 2, 2>::run' requested herest )i,d I8tn | iBIdlMIoPncLBk_l(CotOchLkrL(e_taFhdUrINedCax(d.AIxld)lx,R. exgd)ru,oc uegp,r( ogCurOpoL(uLgpNr)Eo,Tu _p D)| I, ^~~~~~~~~~~R E C| T ^~~~~~~~~~~, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~d x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT,ET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:I dnote: xfield 'group' will be initialized after field 'stepSize'. x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads(n t563h | r e a d ss)t,e ptSiidzIen(BnlcocclkS(htmherme.acdoImdmx..bxu)f,f Sgirzoeusp[(NgCrCoLu_pP)R,O T O| _ ^~~~~~~~~~~S IMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx908. 43 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 43 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for host. 43 warnings generated when compiling for gfx900. 43 warnings generated when compiling for gfx906. 43 warnings generated when compiling for gfx1100. 43 warnings generated when compiling for gfx1101. 43 warnings generated when compiling for gfx1102. 43 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.]bu/fNfCSCiLz_eSsT[ENPCSC/Ls_iPzReOoTfO(_TS)I)M P{L E ]| / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C C| L group(group_ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here687 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | 687 | p r i m sp(rtiimds-(ttiiddS-ttairdtSStcaartttBecra,s tn,T hnrTehardesaSdcsaBtctaesrt,, N&UdLiLr,e cdti-r>eocutt-,> unpu,l laprtgrs,- >asregnsd-b>usfefn,d baurfgfs,- >arregcsv-b>urfefc,v b u| f ^f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u n WRournkWEolrekmEelnetmo(t)o.>r(u)n.(rwuen)(;w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp1::4 :note: 1in instantiation of member function 'RunWork, 2, 2>::run' requested here: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | I M4P | LI_MCPOLL_LC_OFLULN_CF(UANlCl(RAeldluRceed,u cCeO,L LCNOELTL_NDEITR_EDCITR,E CSTI,M PSLIEM,P LPEr,o dP,r oidn,t 8i_ntt)8 _ t| )^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::95391:: 95note: :expanded from macro 'IMPL_COLL_FUNC' note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | R u nRWuonrWkop,e >N,C CNLC_CALL_GAOL_G#O#_a#l#gaol,g oN,C CNLC_CPLR_OPTROO_T#O#_p#r#optroo>t(o)>.(r)u.nr(u&nn(c&cnlcSchlmSehmm.ewmo.rwko)r;k )\; \| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'nthreads' will be initialized after field 'tidInBlock'15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 60field 'group' will be initialized after field 'stepSize': note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->recvbuff, | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:(53t:i dnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nth r202e | a d s ( n t h r eRaudnsW)o,r ktEildeImneBnltou(p)).,r u n| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~w e )| ; tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :s6t:e1p:S inote: zin instantiation of member function 'RunWork, 2, 2>::run' requested heree (nccl S6h | mIeMmP.Lc_oCmOmL.Lb_uFfUfNSCi(zAelsl[RNeCdCuLc_eP,R OCTOOL_LSNIEMTP_LDEI]R/ENCCTC,L _SSITMEPPLSE/,s iPzreoodf,( Ti)n)t 3{2 _ t| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | Ru n666W | o r k < n c c l Fpurnicm#s#(ftuindc,, ntTyhpree,a dFsuGnact#h#edre,v rdeidroepcpuep>,, NNUCLCLL,_ AaLrGgOs_-#>#saelngdob,u fNfC,C La_rPgRsO-T>Or_e#c#vpbruoftfo,> ( )| . ^r un(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:l202S:h53m:e mnote: .in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herew ork); 202\ | | ^ R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562W:o15r:k Enote: lfield 'nthreads' will be initialized after field 'tidInBlock'e ment((n)t.hrruena(dwse)),; t i| d ^I nBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:h4r:e1a:d Inote: din instantiation of member function 'RunWork, 2, 2>::run' requested herex .x), g4r | oIuMpP(Lg_rCoOuLpL)_,F U N| C ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIREC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dIdx.x), group(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~i d(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,60 :n tnote: hfield 'group' will be initialized after field 'stepSize'r eads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d I| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x .x), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~( ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStart/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hReduce:,562 :n15T:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dsReduce, direct->down ,562 | & d i r etcitd-(>toiudt),, anrtghsr-e>asdesn(dnbtuhfrfe,a dasr)g,s -t>irdeIcnvBbluofcfk,( t h| r ^e adIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | R usntWeoprSkiEzlee(mnecnctlP(R)O.TrOu_nS(IwMeP)L;E ] /| N ^C CL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpps:i5z:e1o:f (note: Tin instantiation of member function 'RunWork, 2, 2>::run' requested here) ) { | 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | I M| P group(groupL _COLL_FUNC(AllReduce, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:L626L:N9E:T _note: Din instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI RECT, S I626M | P L E , P r o dp,r iumisn(tt8i_dt-)t i d| S^t artSc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:t391t:e95r:, note: nexpanded from macro 'IMPL_COLL_FUNC'T hreadsSc a391t | t e rR,u nNWUoLrLk,< ndcicrleFcutn-c>#u#pf,u nacr,g st-y>psee,n dFbuunfcf#,# daervgrse-d>orpef,, N C| C ^L _ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:g202o:,53 :N Cnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL _PROT O202_ | # # p r o t o > (R)u.nrWuonr(k&EnlcecmleSnhtm:(15):. rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n (we); | 562 ^ | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:i5d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh reads (5n | tIhMrPeLa_dCsO)L,L _tFiUdNICn(BAllolcRke(dtuhcree,a dCIOdLxL.NxE)T,_ DgIrRoEuCpT(,g rSoIuMpP)L,E , | P ^~~~~~~~~~~~~~~~~r od,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n60t:8 _note: tfield 'group' will be initialized after field 'stepSize') | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391t:i95d:( tnote: iexpanded from macro 'IMPL_COLL_FUNC'd ), nthre a391d | s ( nRtuhnrWeoardks<)n,c ctliFduInncB#l#ofcukn(ct,h rteyapdeI,d xF.uxn)c,# #gdreovurpe(dgorpo ,| ^~~~~~~~~~~N CCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h11::562 :note: 15in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 677 | pr ims(tid-tidS562t | a r t B ctaisdt(,t indT)h,r enatdhsrBecaadsst(,n t&hdrieraedcst)-,> otuitd,I ndBilroecckt(-t>hdroewand,I daxr.gxs)-,> sgernodubpu(fgfr,o uapr)g,s - >| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e c v| b tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u ff, | ^563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.htepSize:(202n:c53c:l Snote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem em.comm.buffSizes [202N | C C L _ P R O T OR_uSnIWMorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562r:g15s:- >warning: sinitializer order does not match the declaration order [-Wreorder-ctor]e ndbuff, args->recvbuf f562, | | ^ tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s(nthr e202a | d s ) , t i d IRnuBnlWoocrkk(EtlhermeeandtI | ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) .run(w e 563)562 | ; | | ^ s ttied/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppp(:St4ii:zd1e):(, n note: cnin instantiation of member function 'RunWork, 2, 2>::run' requested herect lhS rh4em | aeIdmMs.P(cLno_tmChmOr.LebLau_dfFsfU)SN,iC z(teAisld[lINRnCeBCdlLuo_ccPekR,(O tTChOOr_LeSLaINdMEIPTdL_xED.]Ix/R)NE,CC CTgL,r_ oSSuTIpEM(PPgSLr/Eos,ui pzP)er,oo fd (,| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~)i )n t| {8 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) _ t| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 | | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:t391e:p95S:i znote: eexpanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h( :n626c: c9391l: | S hnote: min instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereRe umn.Wc oo626rm | km <. nb cu cf lf FS ui nzpcer#si#[mfNsuC(nCtcLi,_d P-tRtyOipTdeOS,_t SaFIruMtnPScLc#Ea#]td/teNevCrrC,eL d_noSTpThiS,zc eaNotCftC(eLTr_),A) L NG{UO L_ L#| ,# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a dl ig| ro group(groupe, c tN-C>CuLp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_,:P 641Ra:Or11Tg:Os _-note: #>in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here#s perno dt641bo | u> f( f) ,. r au rn g( s& -n >cprcrelicSmvhsbm(uetfmif.d,w- ot ri| kd ^)S ;t a\r /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht :R| 202e ^:d 53u:c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h e:note: ,562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here : n15T: h 202rnote: | efield 'nthreads' will be initialized after field 'tidInBlock' a d s R562 e | d u c Re u,tn iWddoi(rrtkeiEcdlt)e-,m> ednnottwhapod,us t)A,,l gatori,gd sIP-nr>Boslteoonc>dk(b()ut.fhrfru,en a(adwrIegd)sx;-. >x r)| e, ^c vgbruof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppuf:p,6( :g 1r| :o ^ u note: pin instantiation of member function 'RunWork, 2, 2>::run' requested here) ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 2026| : | ^~~~~~~~~~~~~~~~~53I :M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h P:note: L562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_: C60O :L202 L_F | U N C ( A l l R eRduuncWeo,r kCEOlLeLmNeEnTt_2(_)t.)r u n| (^w e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 391 | R u5n | WIoMrPkL<_nCcOcLlLF_uFnUcN#C#(fAulnlcR,e dtuycpee,, CFOuLnLcN#E#Td_eDvIrReEdCoTp,< tSyIpMeP>L,E ,N CPCrLo_dA,L GuOi_n#t#8a_ltg)o , | N^C CL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_391#:#95p:r otnote: oexpanded from macro 'IMPL_COLL_FUNC'> ().run(& n391c | c l SRhumneWmo.rwko(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 15field 'group' will be initialized after field 'stepSize': note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, tidInBl:o562c:k15(:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkd,e vNrCeCdLo_pA#,a lNgCoC,L _NACLCGLO__P#R#OaTlOg_o#,# pNrCoCtLo_>P(R)O.TrOu_n#(#&pnrcoctloS>h(m)e.mr.uwn(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15(:n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads), tidInBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i dInBloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f f S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z es[NCCL _563P | R O T O _sStIeMpPSLiEz]e/(NnCcCcLl_SShTmEePmS./csoimzme.obfu(fTf)S)i z{e s [| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupP ROTO_SIMPLE]/NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:S655/:sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkEle>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h), :| 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 15 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | st e562p | S i z e (tnicdc(ltSihdm)e,m .nctohmrme.abdusf(fnStihzreesa[dNsC)C,L _tPiRdOITnOB_lSoIcMkP(LtEh]r/eNaCdCILd_xS.TxE)P,S /gsriozuepo(fg(rTo)u)p ){, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | group(group tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l677S:h11m:e mnote: .in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec omm.buffS i677z | e s [ N C C L _ P R OpTrOi_mSsI(MtPiLdE-]t/iNdCSCtLa_rStTBEcPaSs/ts,i zneTohfr(eTa)d)s B{c a s| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, &| d group(groupi rect->out, direct->down, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:g626s:-9>:s enote: nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered buff, ar g626s | - > r e c v b u fpfr,i m s| ( ^t id-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r202t:S53c:a tnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree r, nT h202r | e a d s S c a t tReurn,W oNrUkLELl,e mdeinrte uTp,, RaerdgOsp-,> sAelngdob,u fPfr,o taor>g(s)-.>rruenc(vwbeu)f;f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 5 | IMP L202_ | CO L L _ F U N C (RAulnlWRoerdkuEclee,m eCnOtL (u)i.nrtu8n_(tw)e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:: 6note: :expanded from macro 'IMPL_COLL_FUNC'1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 391 | Ru6n | WIoMrPkL<_nCcOcLlLF_uFnUcN#C#(fAulnlcR,e dtuycpee,, CFOuLnLcN#E#Td_eDvIrReEdCoTp,< tSyIpMeP>L,E ,N CPCrLo_dA,L GiOn_t#3#2a_ltg)o , | N^C CL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:#391#:p95r:o tonote: >expanded from macro 'IMPL_COLL_FUNC'( ).run(&nc c391l | S h mReumn.Wwoorrkk<)n;c c\l F u| n ^c ##func,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562y:p15e:, note: Ffield 'nthreads' will be initialized after field 'tidInBlock'u nc##dev r562e | d o p < ttyipde(>t,i dN)C,C Ln_tAhLrGeOa_d#s#(anltghor,e aNdCsC)L,_ PtRiOdTIOn_B#l#opcrko(ttoh>r(e)adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.htO:iT562dO:I_15n#:B# lpwarning: orinitializer order does not match the declaration order [-Wreorder-ctor]co kt(ot>h( r)562e. | ar du In d( x&t.nixcd)c(,lt Sighdrm)oe,um p.n(wtgohrrrokeu)ap;d) s,\( n t| | h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^r e a| d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s: )562,: 15t :i563 d | note: I field 'nthreads' will be initialized after field 'tidInBlock'n B l o sc562tk | e( pt Sh ir zeteai(ddnI(cdtcxil.dSx)h),m, e nmgt.rhcorouempam(d.gsbr(uonfutfphS)ri,ez ae ds| s[ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~)N ,C C| tL tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i_ dPROT O563_ | IS nI BM lP oLscEtk]e(/ptNShCirCzeLea_(dSnITcdEcxPl.SSx/h)sm,ie zmge.rocofou(mpTm().g)br uo{fu fp S)| i, ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e s| | [ ^~~~~~~~~~~~~~~~~ group(groupN C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P60R:O Tnote: Ofield 'group' will be initialized after field 'stepSize'_ SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP :L562666E | :] 9/ :N C note: Ctin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereLi _dS(TtE iP666dS | )/ ,s i nz te ho rf e( aTpd)rs)i( mn{st (h tr| ie ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~da ,d s| n) group(groupT, h rteiad/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hdI:sn687GB:al11to:hc eknote: r(in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here,t hdri er687ae | dc It d- x> .u xp ), , N gU rLpoLru,ip m(asgr(rgtosiu-dp>-)st,ei nd dS| bt ^~~~~~~~~~~ua frft,B caarsgts,- >nrTehcrvebaudfsfB,c a s| t ^, &dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:t202-:>53o:u tnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here nullp t202r | , a r g s - > sReunndWbourfkfE,l eamregnst-<>Frne,c vTb,u fRfe,d O p| , ^ Algo, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202t:o53>:( )note: .in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer un(we )202; | | ^ RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppo:r8k:E1l:e mnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested heren td(u)c.er,u nC(OwLeL)N;E T _| D ^I RECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppS:I5M:P1L:E ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereP rod, i5n | tI6M4P_Lt_)C O L| L^_ FUNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:l391l:R95e:d unote: cexpanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | stepSize(ncclS h562m | e m tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hffSiz:e562s:[15N:C Cwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ PROTO_SIMPLE]/NCCL_STEPS/ s562i | z e o f (tTi)d)( t{ i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h.:x626):,9 :g rnote: oin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~626 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) p r563i | m s ( t isdt-etpiSdiSztea(rntcScclaSthtmeerm,. cnoTmhmr.ebaudfsfSSciaztetse[rN,C CNLU_LPLR,O TdOi_rSeIcMtP-L>Eu]p/,N CaCrLg_sS-T>EsPeSn/dsbiuzfefo,f (aTr)g)s -{> r e| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v b u| f group(groupf , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: 202note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here R u641n | W o r k E l e m e n tpT(h)r.eraudns(Rweed)u;c e ,| ^d irect->dow/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppn:,7 :&1d:i rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herec t->ou t7, | IaMrPgLs_-C>OsLeLn_dFbUuNfCf(,A lalrRgesd-u>cree,c vCbOuLfLfN,E T _| D ^I RECT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L202E:,53 :P rnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered , uint 32022 | _ t ) | ^ RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:l391e:m95e:n tnote: l(Fu)n.cr#u#nf(uwnec),; t y| p ^e , Func##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:e6v:r1e:d onote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here< type> ,6 | NICMCPLL__ACLOGLOL__F#U#NaCl(gAol,l RNeCdCuLc_eP,R OCTOOL_L#N#EpTr_oDtIoR>E(C)T.,r uSnI(M&PncLcEl,S hmPermo.dw,o rikn)t;3 2\_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtki),, N C| C ^~~~~~~~~~~~~~~~~L _ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#60a:l gnote: ofield 'group' will be initialized after field 'stepSize', NCCL_P R562O | T O _ # #tpirdo(ttoi>d()),. rnutnh(r&enacdcsl(Snhtmherme.awdosr)k,) ;t i\d I n| B ^l ock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .note: xfield 'nthreads' will be initialized after field 'tidInBlock') , group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562626::159:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | 562 | p r itmisd((ttiidd-)t,i dnSttharretaSdcsa(tnttehrr,e andTsh)r,e atdisdSIcnaBtltoecrk,( tNhUrLeLa,d Iddixr.exc)t,- >gurpo,u pa(rggrso-u>ps)e,n d b| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f f ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rgs->re c563v | b u f f ,s t e| p ^S ize(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m202e:m53.:c onote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem .buff S202i | z e s [ N C C L _RPuRnOWToOr_kSEIlMePmLeEn]t/ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( ) .| r group(groupu n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::117:: 1note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6877 | | I M P L _ C O L L _pFrUiNmCs((AtlildR-etdiudcSet,a rCtOBLcLaNsEtT,_ DnITRhErCeTa,d sSBIcMaPsLtE,, &Pdriorde,c tu-i>notu3t2,_ tn)u l l| p^t r, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:-391>:s95e:n dnote: bexpanded from macro 'IMPL_COLL_FUNC'u ff, arg s391- | > r eRcuvnbWuofrfk,< n c| c ^l Func##fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:,202 :t53y:p enote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here Func# #202d | e v r e d o p < tRyupneW>o,r kNEClCeLm_eAnLtGt(o)>.(r)u.nr(uwne()&;n c c| l ^S hmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppr:k6):;1 :\ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| ^ 6 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_15C:O Lnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ FUNC(A l562l | R e d u ctei,d (CtOiLdL)N,E Tn_tDhIrReEaCdTs,( nStIhMrPeLads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nth:reads )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here tidInBlock(th r6e | aIdMIPdLx_.CxO)L,L _gFrUoNuCp((AglrloRuepd)u,c e ,| ^~~~~~~~~~~C OLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hthreads(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidInBlock(threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s (| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hreads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.mx.)c,o mgmr.obuupf(fgSriozueps)[,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ P R| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T O_SIM P563L | E ] / N CsCtLe_pSSTiEzPeS(/nsciczleSohfm(eTm).)c o{m m .| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| S group(groupi zes[NCCL_PROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:L666E:]9/:N Cnote: Cin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _STEPS /666s | i z e o f ( T ) )p r{i m s| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| , group(group nThreadsGather, direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h-:>626u:p9,: Nnote: Uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL L, args- >626s | e n d b u f f , parrigmss-(>triedc-vtbiudfSft,a r t| S ^c atter, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:T202h:r53e:a dnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS catte r202, | N U L L , d iRruencWto-r>kuEpl,e maerngts<-F>ns,e nTd,b uRfefd,O pa,r gAsl-g>or,e cPvrboutfof>,( ) .| r ^u n(we); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :10:1: 202note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IRMuPnLW_oCrOkLELl_eFmUeNnCt( (S)I.MrPuLnE(,w eP)r;o d ,| ^h alf) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp^: 7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: 7note: | expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15(: note: field 'nthreads' will be initialized after field 'tidInBlock'n threads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o c k| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hreadIdx.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~~~~~~~c lShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:c60o:m mnote: .field 'group' will be initialized after field 'stepSize'b uffSize s562[ | N C C L _tPiRdO(TtOi_dS)I,M PnLtEh]r/eNaCdCsL(_nStThErPeSa/dssi)z,e otfi(dTI)n)B l{o c k| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa dIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdbuff, arg:s562-:>15r:e cwarning: vinitializer order does not match the declaration order [-Wreorder-ctor]b uff, | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53t:i dnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret id), nt h202r | e a d s ( n t h rReuandWso)r,k EtliedmIennBtlo(u)p.)r,u n (| w ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e ) ;| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp : 7s:t1e:p Snote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herez e(ncc l7S | hImMePmL._cCoOmLmL._bFuUfNfCS(iAzlelsR[eNdCuCcLe_,P RCOOTLOL_NSEITM_PDLIER]E/CNTC,C LS_ISMTPELPES,/ sPirzoedo,f (uTi)n)t 3{2 _ t| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:W666o:r9k:< nnote: cin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec lFunc# #666f | u n c , t y p ep,r iFmusn(ct#i#dd,e vnrTehdroepah,e rN,C CdLi_rAeLcGtO-_>#u#pa,l gNoU,L LN,C CaLr_gPsR-O>TsOe_n#d#bpurfoft,o >a(r)g.sr-u>nr(e&cnvcbculfSfh,m e m| . ^w ork); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562202: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunW o562r | k E l e mteindt(().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:_562D:I15R:E Cwarning: Tinitializer order does not match the declaration order [-Wreorder-ctor], SIMPLE, Prod, uint 65624 | _ t ) t| i^d (tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: (expanded from macro 'IMPL_COLL_FUNC'n threads) ,391 | t i dRIunnBWloorckk<(ntchcrleFaudnIcd#x#.fxu)n,c ,g rtoyuppe(,g rFouunpc)#,# devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:t562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] hreadIdx.x), group(grou p562) | , | ^~~~~~~~~~~t id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dwarning: )initializer order does not match the declaration order [-Wreorder-ctor], nthreads(nthrea d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~( threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThrECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :C562O:L15L:N Ewarning: initializer order does not match the declaration order [-Wreorder-ctor] T_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->ou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNt, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ C(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hti:d562I:n15B:l owarning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (threadIdx.x), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, : 562| : ^~~~~~~~~~~~~~~~~15 : warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d (ttiidd()t,i dn)t,h rntehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ckt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreadts)o,> (t)i.drIunnB(l&oncckc(ltShhrmeeamd.Iwdoxr.kx));, \g r o| u ^p (group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562687::1511:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | 562 | p r i m st(itdi(dt-itdi)d,S tnatrhtrBecaadsst(,n tnhTrheraedasd),s BtciadsItn,B l&odcikr(etchtr-e>aoduItd,x .nxu),l lgprtoru,p (agrrgosu-p>)s,e n d| b ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u f f| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) args->r e563c | v b u f fs,t e p| S ^i ze(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m202e:m53.:c onote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem .buff S202i | z e s [ N C C L _RPuRnOWToOr_kSEIlMePmLeEn]t/ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( ) .| r group(groupu n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::8655::111:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 8 | I M655P | L _ C O L L _ F U N Cp(rAilmlsR(etdiuce, COLd-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : warning: sinitializer order does not match the declaration order [-Wreorder-ctor]t epSize(ncclShmem.c o562m | m . b u ftfiSdi(zteisd[)N,C CnLt_hPrReOaTdOs_(SnItMhPrLeEa]d/sN)C,C Lt_iSdTIEnPBSl/oscikz(etohfr(eTa)d)I d{x . x| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, g| r group(groupo up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:i677z:e11(:n cnote: cin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel Shmem.com m677. | b u f f S i z e s [ NpCrCiLm_sP(RtOiTdO-_tSiIdMSPtLaEr]t/BNcCaCsLt_,S TnETPhSr/esaidzseBocfa(sTt),) &{d i r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c t -| > group(groupo ut, direct->down, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:r655g:s11-:> snote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren dbuff, a r655g | s - > r e c v b u f fp,r i m| s ^( tid-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r202t:R53e:d unote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree , nTh r202e | a d s R e d u c eR,u nnWuolrlkpEtlre,m e&ndti,o uRte,d Oapr,g sA-l>gsoe,n dPbruoftfo,> (a)r.grsu-n>(rweec)v;b u f| f ^, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202in instantiation of member function 'RunWork, 2, 2>::run' requested here: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 12 | IMP L202_ | C O L L _ F U N CR(uAnlWloRrekdEulceem,e nCtO,( )d.oruubnl(ew)e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:: 9note: :expanded from macro 'IMPL_COLL_FUNC'1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 391 | R9u | nIWMoPrLk_E,, NPCrCoLd_,A LuGiOn_t#6#4a_ltg)o , | N^C CL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_391#:#95p:r onote: texpanded from macro 'IMPL_COLL_FUNC'o >().run( &391n | c c lRSuhnmWeomr.kw , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::60 :note: field 'nthreads' will be initialized after field 'tidInBlock'note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6015:: note: warning: field 'group' will be initialized after field 'stepSize'initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hze:(562n:c15c:l Swarning: hinitializer order does not match the declaration order [-Wreorder-ctor]m em.comm.buffSizes[N C562C | L _ P R OtTiOd_(StIiMdP)L,E ]n/tNhCrCeLa_dSsT(EnPtSh/rseiazdeso)f,( Tt)i)d I{n B l| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c k (| t group(grouph readIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:g655r:o11u:p )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 655 | 563 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n15(:& nwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]c lShmem.work); \ 562| | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, note: nfield 'nthreads' will be initialized after field 'tidInBlock't hreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), gro u563p | ( g r o uspt)e,p S i| z ^~~~~~~~~~~~~~~~~e (nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562S:h60m:e mnote: .field 'group' will be initialized after field 'stepSize'c omm.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:)641,: 11 :| ^~~~~~~~~~~note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ endbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:t562i:d15S:t awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]t Reduce, nThread s562R | e d u c et,i dn(utlildp)t,r ,n t&hdrieraedcst(-n>tohurte,a dasr)g,s -t>isdeInndBbluofcfk,( tahrrgesa-d>Irdexc.vxb)u,f fg,r o u| p ^( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 53 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563202 | | s t e p SRiuzneW(onrckcEllSehmmeenmt.S(I)M.PrLuEn](/wNeC)C;L _ S| T ^E PS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppf:(9T:)1): {note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 9 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:(655A:l11l:R enote: din instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu ce, COLL N655E | T _ D I R E C T , SpIrMiPmLsE(,t iPdr-otdi,d Sutianrtt6R4e_dtu)c e ,| ^n Threads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:e391d:u95c:e ,note: expanded from macro 'IMPL_COLL_FUNC'n ullptr, & d391i | r e cRtu-n>Woourtk,< nacrcglsF-u>nsce#n#dfbuunfcf,, tayrpges,- >Fruenccv#b#udfefv,r e d| o ^p , NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202A:L53G:O _note: #in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# algo, N202C | C L _ P R O T O _R#u#npWroortkoE>l(e)m.ernutn<(F&nn,c cTl,S hRmeedmO.pw,o rAkl)g;o ,\ P r| o ^t o>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:w15e:) ;note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp : 10 :t1i:d (note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herei d), n t10h | rIeMaPdLs_(CnOtLhLr_eFaUdNsC)(,A ltliRdeIdnuBcleo,c kC(OtLhLrNeEaTd_IDdIxR.ExC)T,, gSrIoMuPpL(Eg,r oPurpo)d,, h| a ^~~~~~~~~~~~~~~~~l f) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: 562note: | expanded from macro 'IMPL_COLL_FUNC' tid(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoec,k (Ftuhnrce#a#ddIedvxr.exd)o,p (,g rNoCuCpL)_,A L G| O ^~~~~~~~~~~_ ##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::641677::1111:: note: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641677 | | pprriimmss((ttiidd--ttiiddSSttaarrttRBecdauscte,, nnTThhrreeaaddssBRceadsutc,e ,& ddiirreecctt-->>oduotw,n ,d i&rdeicrte-c>td-o>wonu,t ,a ragrsg-s>-s>esnednbdubfuff,f ,a ragrsg-s>-r>ercevcbvubfuff,f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::1010::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 1010 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, PPrroodd,, hhaallff)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<, NCCLy_pAeL>G,O _N#C#CaLl_gAoL,G ON_C#C#La_lPgRoO,T ON_C#C#Lp_rPoRtOoT>O(_)#.#rpurno(t&on>c(c)l.Srhumne(m&.nwcocrlkS)h;m e\m . w| o ^r k); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 60| : ^~~~~~~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: 562note: | field 'group' will be initialized after field 'stepSize' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::11562::115:: note: warning: in instantiation of member function 'RunWork, 2, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 11 | IMPL_COL L562_ | F U N C (tAildl(Rteiddu)c,e ,n tChOrLeLaNdEsT(_nDtIhRrEeCaTd,s )S,I MtPiLdEI,n BPlroocdk,( tfhlroeaatd)I d x| .^x ), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:(95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)391 | RunW o563r | k < n c csltFeupnSci#z#ef(unnccc,l Sthympeem,. cFoumnmc.#b#udfefvSriezdeosp[P,R ONTCOC_LS_IAMLPGLOE_]#/#NaClCgLo_,S TNECPCSL/_sPiRzOeToOf_(#T#)p)r o{t o >| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) . r| u group(groupn (&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:k655):;11 :\ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562655: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' pr i562m | s ( t i dt-itdi(dtSitda)r,t Rnetdhurceea,d sn(TnhtrheraedasdRse)d,u ctei,d InnuBlllopctkr(,t h&rdeiardeIcdtx-.>xo)u,t ,g raorugps(-g>rsoeunpd)b,u f f| , ^~~~~~~~~~~~~~~~~ arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:r60e:c vnote: bfield 'group' will be initialized after field 'stepSize'u ff, | ^ 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t202i:d53):, note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hreads (202n | t h r e a d s ) ,R utniWdoIrnkBElloecmke(ntth (| ) ^~~~~~~~~~~. run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hid), nt:h666r:e9a:d snote: (nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads), tidInB l666o | c k ( t h r e a dpIrdixm.sx()t,i dg,r onuTph(rgeraoduspG)a,t h e| r ^~~~~~~~~~~, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkd,) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) endbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthread{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 687 :| 11 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | st e687p | S i z e ( n c c l S hpmreimm.sc(otmimd.-btuifdfSStiazretsB[cNaCsCtL,_ PnRTOhTrOe_aSdIsMBPcLaEs]t/,N C&CdLi_rSeTcEtP-S>/osuitz,e onfu(lTl)p)t r{, a| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g s -| > group(groups endbuff, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e677c:v11b:u fnote: fin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^ 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 :p rnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem s(tid-t i202d | S t a r t B c a sRtu,n WnoTrhkrEelaedmseBncta,o uAtl,g od,i rPercott-o>>d(o)w.nr,u na(rwges)-;> s e| n ^d buff, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp-:>12r:e1c:v bnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heref f, | ^ 12 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L202L:_53F:U Nnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( AllRed u202c | e , C O L L N ERTunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::56212::151:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_CO L562L | _ F U N Ct(iAdl(ltRiedd)u,c en,t hCrOeLaLdNsE(Tn_tDhIrReEaCdTs,) ,S ItMiPdLIEn,B lPorcokd(,t hdroeuabdlIed)x . x| )^, group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r391o:u95p:) ,note: expanded from macro 'IMPL_COLL_FUNC' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | R563u | n W o r ksO,_ SNICMCPLL_EA]L/GNOC_C#L#_aSlTgEoP,S /NsCiCzLe_oPfR(OTT)O)_ #{# p r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t o >| ( group(group) .run(&ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m626.:w9o:r knote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here; \ | ^ 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15p:r inote: mfield 'nthreads' will be initialized after field 'tidInBlock's (tid-ti d562S | t a r t Stciadt(tteird,) ,n TnhtrheraedasdSsc(antttherre,a dNsU)L,L ,t iddiIrneBclto-c>ku(pt,h raeragdsI-d>xs.exn)d,b ugfrfo,u pa(rggrso-u>pr)e,c v b| u ^~~~~~~~~~~~~~~~~f f, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53562: | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid(t i202d | ) , n t h r e aRdusn(WnotrhkrEelaedmse)n,t r(o)u.pr(ugnr(owuep));, | | ^ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g15o:, warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C CL_PROTO_##proto>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lnote: ofield 'nthreads' will be initialized after field 'tidInBlock'c k(threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads), t i563d | I n B l osctke(ptShirzeea(dnIcdcxl.Sxh)m,e mg.rcooumpm(.gbruofufpS)i,z e s| [ ^~~~~~~~~~~~~~~~~N CCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O60T:O _note: Sfield 'group' will be initialized after field 'stepSize'I MPLE]/ 562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a687d:I11d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , group( g687r | o u p ) , | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfl:a562g:215;: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | 562 | u itnitd3(2t_itd )d,a tnat1h,r efaldasg(1n,t hdraetaad2s,) ,f ltaigd2I;n B l| o ^~~~~c k(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hh:r153e:a28d:I dwarning: xunused variable 'data2' [-Wunused-variable]. x), g r153o | u p ( g ruoiunpt)3,2 _ t| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d a t| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)1 , flag1 ,563 | d a t a 2s,t efplSaigz2e;( n c| c ^~~~~l Sh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hm:e153m:.35c:o mwarning: munused variable 'flag2' [-Wunused-variable]. buff S153i | z e s [ NuCiCnLt_3P2R_OtT Od_aStIaM1P,L Ef]l/aNgC1C,L _dSaTtEaP2S,/ sfilzaego2f;( T )| ) ^~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h :666386 | : 9 : warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] prims( t386i | d , n Tihnrte awdisrGeaOtfhfesre,t d=i rWeicrte-W>ourpd,P eNrUSLlLi,c ea*rwgasr-p> s+e n2d*bwuifdf;, a| r ^g s->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hPROTO:_562S:I15M:P Lwarning: Einitializer order does not match the declaration order [-Wreorder-ctor]] /NCCL_STEPS/sizeof(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a666d:I9d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , group(gr o666u | p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ p| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i ms(tid, 563n | T h r e asdtseGpaStihzeer(,n cdcilrSehcmte-m>.ucpo,m mN.UbLuLf,f Sairzgess-[>NsCeCnLd_bPuRfOfT,O _aSrIgMsP-L>Er]e/cNvCbCuLf_fS,T E P| S ^/ sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :{202 : 53| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| group(group 202 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hW:o677r:k11E:l enote: min instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree nti(d)-.triudnS(twaer)t;B c a| s ^t , nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppB:c5a:s1t:, note: &in instantiation of member function 'RunWork, 2, 2>::run' requested hered irect -5> | oIuMtP,L _dCiOrLeLc_tF-U>NdCo(wAnl,l Raerdgusc-e>,s eCnOdLbLuNfEfT,_ DaIrRgEsC-T>,r eScIvMbPuLfEf,, M a| x ^, uint8_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 202 :| 53^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: 202note: | expanded from macro 'IMPL_COLL_FUNC' R u391n | W o rRkuEnlWeomrekn#(d)e.vrruend(owpe<)t;y p e| > ^, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppG:O5_:#1#:a lnote: gin instantiation of member function 'RunWork, 2, 2>::run' requested hereo , NCCL _5P | RIOMTPOL__#C#OpLrLo_tFoU>N(C)(.ArlulnR(e&dnucccel,S hCmOeLmL.NwEoTr_kD)I;R E\C T ,| ^S IMPLE, Max,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n15t:8 _note: tfield 'nthreads' will be initialized after field 'tidInBlock') | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95t:i dnote: (expanded from macro 'IMPL_COLL_FUNC't id), nthr e391a | d s (RnutnhWroerakd , | N ^~~~~~~~~~~~~~~~~C CL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, NCC L562_ | P R O T Ot_i#d#(ptriodt)o,> (n)t.hrruena(d&sn(cnctlhSrhemaedms.)w,o rtki)d;I n\B l o| c ^k (thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'g roup(gr o562u | p ) , t| i ^~~~~~~~~~~d (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidS t a| r^t Scatter, nThreadsSc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:t391t:e95r:, note: Nexpanded from macro 'IMPL_COLL_FUNC'U LL, direct->up, 391a | r g sR-u>nsWeonrdkb, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: 562initializer order does not match the declaration order [-Wreorder-ctor] | tid(tid), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~x ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hid), nthrea:d562s:(15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n B l| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c k(threadIdx.x), gro u563p | ( g r o uspt)e,p S i| z ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e ( n| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c lShmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 687 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here prims(ti d687, | n T h r e a d s G aptrhiemrs,( tdiidr-etcitd-S>tuapr,t BNcUaLsLt,, anrTghsr-e>asdesnBdcbausftf,, &adrigrse-c>tr-e>covubtu,f fn,u l l| p ^t r, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:n202d:b53u:f fnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here args-> r202e | c v b u f f , R| u ^n WorkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202<:F53n:, note: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, RedO p202, | A l g o , P rRoutnoW>o(r)k.Erluenm(ewnet)<;F n ,| ^T , RedOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :A7l:g1o:, note: Pin instantiation of member function 'RunWork, 2, 2>::run' requested herer oto>( )7. | rIuMnP(Lw_eC)O;L L _| F ^U NC(AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:c7e:,1 :C Onote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereL NET_D I7R | EICMTP,L _SCIOMLPLL_EF,U NMCa(xA,l luRiendtu3c2e_,t )C O L| L^N ET_DIRECT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391S:I95M:P Lnote: Eexpanded from macro 'IMPL_COLL_FUNC', Max, uin t3913 | 2 _ tR)u n W| o^r k, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d15x:. xwarning: )initializer order does not match the declaration order [-Wreorder-ctor], group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 15warning: :initializer order does not match the declaration order [-Wreorder-ctor] warning: initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::666641::911:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | 641 | p r i m sp(rtiimds,( tniTdh-rteiaddSstGaartthReerd,u cdei,r encTth-r>euapd,s RNeUdLuLc,e ,a rdgisr-e>cste-n>ddbouwfnf,, &adrigrse-c>tr-e>covubtu,f fa,r g s| - ^> sendbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a202r:g53s:- >note: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree cvbuff ,202 | | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o202r:k53E:l enote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ntn(t)<.Frnu,n (Tw,e )R;e d O| p ^, Algo, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppo:t4o:>1(:) .note: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereu n(we) ;4 | I| M ^P L_COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppU:N5C:(1A:l lnote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested heree duce, C5O | LILMNPELT__CDOILRLE_CFTU,N CS(IAMlPlLREe,d uMcaex,, CiOnLtL8N_EtT)_ D I| R^E CT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L391E:,95 :M anote: xexpanded from macro 'IMPL_COLL_FUNC', uint8_t) 391 | | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391<:n95c:c lnote: Fexpanded from macro 'IMPL_COLL_FUNC'u nc##func, 391t | y p eR,u nFWuonrck#<#ndcecvlrFeudnocp#<#tfyupnec>,, tNyCpCeL,_ AFLuGnOc_####daelvgroe,d oNpCO,T ON_C#C#Lp_rAoLtGoO>_(#)#.arlugno(,& nNcCcClLS_hPmReOmT.Ow_o#r#kp)r;o t\o > (| ) ^. run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562S:h15m:e mnote: .field 'nthreads' will be initialized after field 'tidInBlock'w ork); \ 562 | | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, note: nfield 'nthreads' will be initialized after field 'tidInBlock't hreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,60 :g rnote: ofield 'group' will be initialized after field 'stepSize'u p(group )562, | | ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d60):, note: nfield 'group' will be initialized after field 'stepSize't hreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~I dx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]t id(tid), nthreads(nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ( t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadIdx .563x | ) , g rsotuepp(Sgirzoeu(pn)c,c l S| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m e m| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c omm.buf f563S | i z e s [sNtCeCpLS_iPzReO(TnOc_cSlISMhPmLeEm]./cNoCmCmL._bSuTfEfPSSi/zseisz[eNoCfC(LT_)P)R O{T O _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I M P| L group(groupE ]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:f626(:T9):) note: {in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 626 | prims(tid-tidStart/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:c641a:t11t:e rnote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nThreadsS c641a | t t e r , N U L L ,p rdiimrse(ctti-d>-utpi,d Satragrst-R>esdeuncdeb,u fnfT,h raeragdss-R>erdeuccveb,u fdfi,r e c| t ^- >down, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:d202i:r53e:c tnote: -in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here> out, a202r | g s - > s e n d bRuufnfW,o rakrEglse-m>ernetc:( )note: .in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer un(we) ;202 | | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppn:W5o:r1k:E lnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herem entc(e),. rCuOnL(LwNeE)T;_ D I| R ^E CT, SIMPLE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :M8a:x1,: uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:c562o:m15m:. bwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]f fSizes[NCCL_PROTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~666 : 9| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 666 | s t e p S i z ep(rnicmcsl(Sthimde,m .ncTohmrme.abdusfGfaStihzeers,[ NdCiCrLe_cPtR-O>TuOp_,S INMUPLLLE,] /aNrCgCsL-_>SsTeEnPdSb/usfifz,e oafr(gTs)-)> r{e c v| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| , group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::641202::1153:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:#15#:p rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]t o>().run(&ncclSh m562e | m . w o rtki)d;( t\i d )| , ^ nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's ), tidI n562B | l o ck ( tthirde(atdi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hId:d)562x,:. 15xn:)t ,hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]ge raodusp((ng tr562ho | ru ep a) d, s t) i,| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~(t ti id| dI tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))n ,B lnotch kr563(e | ta hd rs e( ansdttIhedrpxeS.aixdz)se,)( ,ng crtcoiludSpIh(nmgBerlmoo.uccpko)(m,tm h. rb| eu ^~~~~~~~~~~~~~~~~af dfIS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdi:xz562.e:xs60)[:,N Cnote: gCfield 'group' will be initialized after field 'stepSize'rL o_uPp R(562Og | Tr Oo _u Sp I)tM,iP dL (E| t] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i/ dN )C| ,C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) L n_tSh Tr563Ee | Pa Sd /s s( insztteheorpfeS(aiTdz)se))( ,n{ c tc il| dS ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ih nm Be| lm group(groupo. ccko(mtmh.rbeuafdfISd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hix:z.626ex:s)9[,:N Cgnote: Crin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereLo _uPpR(OgTr Oo626_u | Sp I) M, P L E| ] ^~~~~~~~~~~ / NpCrCiLms_(StTiEdP-St/isdiSzteaorft(STc)a)t t{e r ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nT h r| e group(groupa dsScatter, NUL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:,626 :d9i:r enote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ->up, ar g626s | - > s e n d b u fpfr,i masr(gtsi-d>-rteicdvSbtuafrft,S c a| t ^t er, nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:S cnote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret ter, N202U | L L , d i r e cRtu-n>Wuop, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ]warning: /initializer order does not match the declaration order [-Wreorder-ctor]N CCL_STEPS/sizeof(T )562) | { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(groupi d), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h626r:e9a:d snote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, tidInBl o626c | k ( t h r e a d Ipdrxi.mxs)(,t igdr-otuipd(SgtraorutpS)c,a t t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r , | n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T hreads S563c | a t t e rs,t eNpUSLiLz,e (dnicrcelcSth-m>eump.,c oamrmg.sb-u>fsfeSnidzbeusf[fN,C CaLr_gPsR-O>TrOe_cSvIbMuPfLfE,] / N| C ^C L_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:o53f:( Tnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~202 | | group(group RunWorkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:<641F:n11,: Tnote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RedOp, A l641g | o , P r o t o > ( )p.rriumns((wtei)d;- t i| d ^S tartReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:,9 :n1T:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea dsRedu c9e | ,I MdPiLr_eCcOtL-L>_dFoUwNnC,( A&ldliRreedcutc-e>,o uCtO,L LaNrEgTs_-D>IsReEnCdTb,u fSfI,M PaLrEg,s -M>arxe,c vubiunftf6,4 _ t| ) ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h53::391 :note: 95in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: expanded from macro 'IMPL_COLL_FUNC' 202 | 391 | RRuunnWWoorrkkEe(d)o.pr),; N C| C ^L _ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp#:a5l:g1: dnote: in instantiation of member function 'RunWork, 2, 2>::run' requested heres ), tidInBlock (5t | hIrMePaLd_ICdOxL.Lx_)F,U NgCr(oAulpl(Rgerdouucpe),, C O| L ^~~~~~~~~~~~~~~~~L NET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hco:m562m:.15b:u fwarning: finitializer order does not match the declaration order [-Wreorder-ctor]S izes[NCCL_PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r666o:u9p:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 666 | 563 | p rsitmesp(Stiizde,( nncTchlrSehamdesmG.actohmemr.,b udfifrSeiczte-s>[uNpC,C LN_UPLRLO,T Oa_rSgIsM-P>LsEe]n/dNbCuCfLf_,S TaErPgSs/-s>irzeecovfb(uTf)f), { | ^| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: 202note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666R | u n W o r k E l epmreinmts<(Ftni,d ,T ,n TRherdeOapd,s GAaltghoe,r ,P rdoitroe>c(t)-.>ruupn,( wNeU)L;L , | a ^r gs->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppb:u5f:f1,: anote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereg s->re c5v | bIuMfPfL,_ C O| L ^L _FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:l202R:e53d:u cnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, COLL N202E | T _ D I R E C T ,R uSnIWMoPrLkEE,l eMmaexn,t note: (expanded from macro 'IMPL_COLL_FUNC') .run(we); 391 | | ^ RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp<:n5c:c1l:F unote: nin instantiation of member function 'RunWork, 2, 2>::run' requested herec ##fun c5, | ItMyP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :687562 | : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] prims(tid-tid S562t | a r t B ctaisdt(,t indT)h,r enatdhsrBecaadsst(,n t&hdrieraedcst)-,> otuitd,I nnBullolcpkt(rt,h raeragdsI-d>xs.exn)d,b ugfrfo,u pa(rggrso-u>pr)e,c v b| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f f ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:t562i:d15S:t awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]t Reduce, nThreadsR e562d | u c e , tniudl(ltpitdr),, &ndtihrreecatd-s>(onutth,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 53 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563202 | | s t e p SRiuzneW(onrckcEllSehmmeenmt.().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolcSkh(mtehmr.ewaodrIkd)x;. x\) , | g ^r oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562(:n15c:c lnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'h mem.comm.buffSizes[N C562C | L _ P R OtTiOd_(StIiMdP)L,E ]n/tNhCrCeLa_dSsT(EnPtSh/rseiazdeso)f,( Tt)i)d I{n B l| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c k (| t group(grouph readIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~655 :11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 :60: note: field 'group' will be initialized after field 'stepSize' 655 | 562 | tpirdi(mtsi(dt)i,d -nttihdrSetaadrst(Rnetdhurceea,d sn)T,h rteiaddIsnRBeldouccke(,t hnruelaldpItdrx,. x&)d,i rgercotu-p>(ogurto,u pa)r,g s -| > ^~~~~~~~~~~s endbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT):)562 :{15 : | warning: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~initializer order does not match the declaration order [-Wreorder-ctor] | group(group 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa d I| d ^x .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) h 563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. b u| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f Sizes[N C563C | L _ P R OsTtOe_pSSIiMzPeL(En]c/cNlCSChLm_eSmT.EcPoSm/ms.ibzuefoffS(iTzes[)N)C C{L _ P| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O T O| _ group(groupS IMPLE]/NCCL_STEPS/sizeof(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):)687 :{11 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | group(group 687 | prims(tid-tidStartBcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:T655h:r11e:a dnote: sin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereB cast, &d i655r | e c t - > o u t , npurlilmpst(rt,i da-rtgisd-S>tsaerntdRbeudfufc,e ,a rngTsh-r>eraedcsvRbeudfufc,e , | n ^u llptr, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r202e:c53t:- >note: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu t, ar g202s | - > s e n d b u fRfu,n WaorrgksE-l>ermeecnvtb, 2, 2>::run' requested here> ().run (202w | e ) ; | ^ RunWorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:m6e:n1t:< Fnote: nin instantiation of member function 'RunWork, 2, 2>::run' requested here, T, R e6d | OIpM,P LA_lCgOoL,L _PFrUoNtCo(>A(l)l.Rreudnu(cwee,) ;C | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ OLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hthr:e562a:d60s:) ,note: field 'group' will be initialized after field 'stepSize't idInBlock(t h562r | e a d I dtxi.dx()t,i dg)r,o up(ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~h reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 15 group(group: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 666 | t i d ( t i dp)r,i mnst(htrieda,d sn(TnhtrheraedasdGsa)t,h etri,d IdniBrleocctk-(>tuhpr,e aNdUILdLx,. xa)r,g sg-r>osuepn(dgbruofufp,) ,a r g| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~- > r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c vbuff, 563| | ^ stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:(53n:c cnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS hmem. c202o | m m . b u f f S iRzuensW[oNrCkCELl_ePmReOnTtO<_FSnI,M PTL,E ]R/eNdCOCpL,_ SATlEgPoS,/ sPirzoetoof>((T)).)r u{n ( w| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A641l:l11R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, COLLN E641T | _ D I R E C T , S IpMrPiLmEs,( tMiadx-,t iudiSntta3r2t_Rte)d u c| e^, nThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95R:e dnote: uexpanded from macro 'IMPL_COLL_FUNC'c e, direc t391- | > d oRwunn,W o&rdkiuonuct#,# faurngcs,- >tsyepned,b uFfufn,c #a#rdgesv-r>erdeocpv , | N ^C CL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:l202g:o53,: Nnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC L_PRO T202O | _ # # p r o t o >R(u)n.Wrournk(E&lnecmcelnSth().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NC562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hreadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) note: field 'nthreads' will be initialized after field 'tidInBlock' , 562| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id(tid )563, | n t h rsetaedpsS(inzteh(rnecacdlsS)h,m etmi.dcIonmBml.obcukf(ftShirzeeasd[INdCxC.Lx_)P,R OgTrOo_uSpI(MgPrLoEu]p/)N,C C L| _ ^~~~~~~~~~~~~~~~~S TEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:s562i:z60e:o fnote: (field 'group' will be initialized after field 'stepSize'T )) { | 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B666l:o9c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh readIdx .666x | ) , g r o u p (pgrriomusp()t,i d ,| ^~~~~~~~~~~n ThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, n | ThIrMePaLd_sCROeLdLu_cFeU,N Cd(iArlelcRte-d>udcoew,n ,C O&LdLiNrEeTc_tD-I>RoEuCtT,, aSrIgMsP-L>Es,e nMdabxu,f fu,i natr3g2s_-t>)r e c| v^b uff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::202 :note: 53expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 391 | R u n W oRruknt(y)p.er>u,n (NwCeCL_AL)G;O _ #| # ^a lgo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppR:O7T:O1_:# #note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgs:-562>:r15e:c vwarning: binitializer order does not match the declaration order [-Wreorder-ctor]u ff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53562: | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid(t i202d | ) , n t h r e aRdusn(WnotrhkrEelaedmse)n,t r(o)u.pr(ugnr(owuep));, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here563 | s t7e | pISMiPzLe_(CnOcLcLl_SFhUmNeCm(.AclolmRme.dbuucfef,S iCzOeLsL[NNECTC_LD_IPRREOCTTO,_ SSIIMMPPLLEE],/ NMCaCxL,_ SuTiEnPtS3/2s_itz)e o f| (^T )) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~391 : 95| : group(group note: expanded from macro 'IMPL_COLL_FUNC' 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:u626n:W9o:r knote: , FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren cclFunc# #626f | u n c , t y p ep,r iFmusn(ct#i#dd-etvirdeSdtoaprt,e rN,C CnLT_hArLeGaOd_s#S#caaltgtoe,r ,N CNCULL_LP,R OdTiOr_e#c#tp-r>outpo,> (a)r.grsu-n>(s&enncdcbluSfhfm,e ma.rwgosr-k>)r;e c\v b u| f ^f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202note: | field 'nthreads' will be initialized after field 'tidInBlock' R u562n | W o r k Etliedm(etnitd<)F,n ,n tTh,r eRaeddsO(pn,t hArlegaod,s )P,r ottiod>I(n)B.lroucnk((wteh)r;e a d| I ^d x.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppo:u8p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | ^~~~~~~~~~~~~~~~~ 8 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_60C:O LL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::20215::53 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 562 | R u ntWiodr(ktEilde)m,e nnttc(k)(.trhurne(awdeI)d;x . x| ) ^, group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:p8):,1 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'RunWork, 2, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 8 | IM P563L | _ C O L Ls_tFeUpNSCi(zAel(lnRcecdluSchem,e mC.OcLoLmNmE.Tb_uDfIfRSEiCzTe,s [SNICMCPLL_EP,R OMTaOx_,S IiMnPtL6E4]_/tN)C C L| _^S TEPS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:(391T:)95): {note: expanded from macro 'IMPL_COLL_FUNC' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 391 | RunWork, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# func, t y666p | e , F u n c # #pdreivmrse(dtoipd<,t ynpTeh>r,e aNdCsCGLa_tAhLeGrO,_ #d#iarlegcot,- >NuCpC,L _NPURLOLT,O _a#r#gpsr-o>tsoe>n(d)b.urfufn,( &anrcgcsl-S>hrmeecmv.bwuofrfk,) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 202note: | field 'nthreads' will be initialized after field 'tidInBlock' Ru n562W | o r k E lteimde(nttid(I)n.Brluonc(kw(et)h;r e a| d ^I dx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:p11(:g1r:o unote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here) , | ^~~~~~~~~~~~~~~~~ 11 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_60C:O Lnote: Lfield 'group' will be initialized after field 'stepSize'_ FUNC(Al l562R | e d u c et,i dC(OtLiLdN)E,T _nDtIhRrEeCaTd,s (SnItMhPrLeEa,d sM)a,x ,t ifdlIonaBtl)o c k| (^t hreadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391.:x95):, note: gexpanded from macro 'IMPL_COLL_FUNC'r oup(group )391, | | R ^~~~~~~~~~~u nWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: 563note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here st e8p | SIiMzPeL(_nCcOcLlLS_hFmUeNmC.(cAolmlmR.ebduufcfeS,i zCeOsL[LNNCECTL__DPIRROETCOT_,S ISMIPMLPEL]E/,N CMCaLx_,S TiEnPtS6/4s_itz)e o f| (^T )) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 391 :| 95 group(group: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref unc, type, 655F | u n c # # d e v r e dporpid,- tNiCdCSLt_aArLtGROe_d#u#cael,g on,T hNrCeCaLd_sPRReOdTuOc_e#,# pnruoltlop>t(r),. r&udni(r&encctc-l>Sohumte,m .awrogrsk-)>;s e\n d b| u ^f f, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:r15e:c vnote: bfield 'nthreads' will be initialized after field 'tidInBlock'u ff, | ^ 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:)53,: nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh reads( n202t | h r e a d s ) , RtuindWIonrBklEolcekm(etnhtr| ( ^~~~~~~~~~~~~~~~~) .run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:w562e:)60;: note: | field 'group' will be initialized after field 'stepSize' ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 8 : 1t:i dnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heret id), n t8h | rIeMaPdLs_(CnOtLhLr_eFaUdNsC)(,A ltliRdeIdnuBcleo,c kC(OtLhLrNeEaTd_IDdIxR.ExC)T,, gSrIoMuPpL(Eg,r oMuapx),, i n| t ^~~~~~~~~~~6 4_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.houp), | ^~~~~~~~~~~~~~~~~ :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::60 :warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~d Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] r o562u | p ) , t| i ^~~~~~~~~~~d (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: 562initializer order does not match the declaration order [-Wreorder-ctor] | tid(tid), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~x ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:( gnote: rfield 'group' will be initialized after field 'stepSize'o up), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~T EPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1560:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'group' will be initialized after field 'stepSize' 562 | ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f15,: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | t i202d | ( t i d ) , n tRhurneWaodrsk(Enltehmreenatd)(,) .grruonu(pw(eg)r;o u p| ) ^, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: 563note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here st e12p | SIiMzPeL(_nCcOcLlLS_hFmUeNmC.(cAolmlmR.ebduufce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWorkElement <562F | n , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:o15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>().run (:562w562 | e: )15 ;: warning: t| initializer order does not match the declaration order [-Wreorder-ctor]i ^ d (tid), nthreads(nt h562r | e a d s )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppt,:i 9dt:(i1tdiIdn)B,:l onnote: ctin instantiation of member function 'RunWork, 2, 2>::run' requested herekh (rteha rd9es | a(IdnMItPdhLxr_.eCxaO)dL,sL )_g,Fr UotNuiCpd((IAgnlrBlolRuoepcd)ku,(c te h,| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C O L| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N ET_DIR E563C | T , S IsMtPeLpES,i zMea(xn,c culiSnhtm6e4m_.tc)o m m| .^b uffSizes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:P391R:O95T:O _note: Sexpanded from macro 'IMPL_COLL_FUNC'I MPLE]/NCCL _391S | T E PRSu/nsWiozreko687,: 11N:C Cnote: Lin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ ALGO_## a687l | g o , N C C L _ P RpOrTiOm_s#(#tpirdo-ttoi>d(S)t.arrutnB(c&ansctc,l SnhTmherme.awdosrBkc)a;s t\, &| d ^i rect->out, nullp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:r562,: 15a:r gnote: sfield 'nthreads' will be initialized after field 'tidInBlock'- >sendbuff, 562a | r g s - >triedc(vtbiudff, | ^ )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s(nth r202e | a d s ) , t i dRIunnBWloorckkE(ltehmreenatd()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n60(:w enote: )field 'group' will be initialized after field 'stepSize'; | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 8t:i1d:( tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered ), n t8h | rIeMaPdLs_(CnOtLhLr_eFaUds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:-562>:d15o:w nwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] args->sendbuff, args->rec v562b | u f f , t i| d ^( tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:( nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh reads) ,202 | t i d I n B l o cRku(ntWhorrekaEdlIedmxe.nxt)<,F ng,r oTu,p (RgerdoOupp,) ,A l g| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, P| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o to>(). r563u | n ( w stepSize(en)c;c l S| h ^m em.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:f9f:S1i:z enote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here[ NCCL_ P9R | OITMOP_LS_ICMOPLLLE_]F/UNNCCC(LA_lSlTREePdSu/csei,z eCoOfL(LTN)E)T _{D I R| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C T ,| group(groupS IMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::95626:: 9note: :expanded from macro 'IMPL_COLL_FUNC' note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | 626 | R u n W o r k < npcrcilmFsu(ntci#d#-ftuindcS,t atrytpSec,a tFtuenrc,# #ndTehvrreeaddospSr,, NNCUCLLL_,A LdGiOr_e#c#ta-l>guop,, NaCrCgLs_-P>RsOeTnOd_b#u#fpfr,o taor>g(s)-.>rruenc(v&bnucfcfl,S h m| e ^m .work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(group), :| 202 ^~~~~~~~~~~: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'nthreads' will be initialized after field 'tidInBlock'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~p (gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)60,: note: | field 'group' will be initialized after field 'stepSize' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563t | i d ( t isdt)e,p Snitzher(enacdcsl(Snhtmherme.acdosm)m,. btuifdfISniBzleosc[kN(CtChLr_ePaRdOITdOx_.SxI)M,P LgEr]o/uNpC(CgLr_oSuTpE)P,S / s| i ^~~~~~~~~~~z eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.heads):,562 :t15i:d Iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]B lock(threadIdx.x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]nt hreads(nthreads), t i562d | I nB l o ckt(itdhr(etaiddI)d,x .nxt)h,r egardosu(pn(gtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t id/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hreadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~C L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d x.x), g563r | o u p ( gsrtoeuppS)i,z e (| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c c l| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h mem.co m563m | . b u f fsStiezpeSsi[zNeC(CnLc_cPlRSOhTmOe_mS.IcMoPmLmE.]b/uNfCfCSLi_zSeTsE[PNSC/CsLi_zPeRoOfT(OT_)S)I M{P L E| ] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/ N C| C group(groupL _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : group(group677 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :677677 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pri m677s | ( t i d - t i d S t aprrtiBmcsa(stti,d -ntTihdrSetaadrstBBccaasstt,, &ndTihrreecatd-s>Bocuats,t ,d i&rdeicrte-c>td-o>wonu,t ,a rdgisr-e>cste-n>ddbouwfnf,, aarrggss-->>sreencdvbbuuffff,, a r| g ^s ->recvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:,53 : | note: ^in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202: | 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here RunW o202r | k E l e m e n t p(,) .Arlugo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:&562:15: warning: dinitializer order does not match the declaration order [-Wreorder-ctor]i rect->out, direct->down, args->sendbu f562f | , a r gtsi-d>(rteicdv)b,u fnft,h r e| a ^d s(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202d:x53.:x )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here group(gro u202p | ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u nWorkE l563e | m e n t f(f)S.irzuens([wNeC)C;L _ P| R ^O TO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp]:/10N:C1C:L _note: Sin instantiation of member function 'RunWork, 2, 2>::run' requested hereT EPS/s i10z | eIoMfP(LT_)C)O L{L _ F| U ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C (| A group(groupl lReduce, COLLNET_DIRECT, SIMPLE, Max, half)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626| :^9 : note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: 626expanded from macro 'IMPL_COLL_FUNC' | p r391i | m s (RtuindW-otrikdc,t -N>CuCpL,_ AaLrGgOs_-#>#saelngdob,u fNfC,C La_rPgRsO-T>Or_e#c#vpbruoftfo,> ( )| . ^r un(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:w202o:r53k:) ;note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here\ | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u15n:W onote: rfield 'nthreads' will be initialized after field 'tidInBlock'k Element< F562n | , T , tRiedd(Otpi,d )A,l gnot,h rPeraodtso(>n(t)h.rreuand(sw)e,) ;t i d| I ^n Block(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:I10d:x1.:x )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here group (10g | rIoMuPpL)_,C O L| L ^~~~~~~~~~~~~~~~~_ FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:A562l:l60R:e dnote: ufield 'group' will be initialized after field 'stepSize'c e, COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M ntPhLrEe,a dMsa(xn,t hhraelafd)s ) ,| ^t idInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p391(:g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC') , | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_MSPILMEP]L/EN]C/CNLC_CSLT_ESPTSE/PsSi/zseiozfe(oTf)()T ){) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::655666::119:: note: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666655 | | p r ipmrsi(mtsi(dt,i dn-TthirdeSatdasrGtaRtehdeurc,e ,d inrTehcrte-a>duspR,e dNuUcLeL,, naurlglsp-t>rs,e n&ddbiurfefc,t -a>rogust-,> raercgs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddx.x), groxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)563 | st e563p | S i z e (sntcecplSSihzmee(mn.cccolmSmh.mbeumf.fcSoimzme.sb[uNfCfCSLi_zPeRsO[TNOC_CSLI_MPPRLOET]O/_NSCICMLP_LSET]E/PNSC/CsLi_zSeToEfP(ST/)s)i z{e o f| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T ) )| group(group{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h11::666 :note: 9in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | 666 | p rpirmism(st(itdi-dt,i dnSTtharretaBdcsaGsatt,h enrT,h rdeiardescBtc-a>sutp,, &NdUiLrLe,c ta-r>gosu-t>,s ennudlblupftfr,, aarrggss-->>rseecnvdbbuuffff,, a| r ^g s->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb:u202f:f53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR unWor k202E | l e m e n t < F nR,u nTW,o rRkeEdlOepm,e nAtld(O)p.,r uAnl(gwoe,) ;P r o| t ^o >().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp(:w10e:)1;: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^ 10 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppI:M10P:L1_:C Onote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _FUNC (10A | lIlMRPeLd_uCcOeL,L _CFOULNLCN(EATl_lDRIeRdEuCcTe,, SCIOMLPLLNEE,T _MDaIxR,E ChTa,l fS)I M P| L^E , Ma/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:,391 :h95a:l fnote: )expanded from macro 'IMPL_COLL_FUNC' | ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 :R95u:n Work, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), tidInBlock(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s), tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PROTO _563S | I M P L Es]t/eNpCSCiLz_eS(TnEcPcSl/Sshimzeemo.fc(oTm)m). b{u f f| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i z e| s group(group[ NCCL_PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h]:/626N:C9C:L _note: Sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT EPS/siz e626o | f ( T ) ) { p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i m s| ( group(groupt id-tidStartScatter, nThreadsS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:a655t:t11e:r ,note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereN ULL, dire c655t | - > u p , a r g s -p>rsiemnsd(btuifdf-,t iadrSgtsa-r>trReecdvubcuef,f ,n T h| r ^e adsRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:u202c:e53,: nnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel lptr, 202& | d i r e c t - > oRuutn,W oarrkgEsl-e>mseenntdpr,e cAvlbguof,f ,P r o| t ^o >().run(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11: 1202: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11R | uInMWPoLr_kCEOlLeLm_eFnUtNTd(,() t.SirIduM)nP,(L wEne,t) h;Mr ae xa| ,d ^ s f(lno/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppta:ht10r):e 1a :d| s^note: ) in instantiation of member function 'RunWork, 2, 2>::run' requested here, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht: i39110d: | I95In:MB Plnote: Loexpanded from macro 'IMPL_COLL_FUNC'_c CkO( Lt391Lh | _r Fe UaRNduCIn(dWAxol.rlxkR), NCnCcLcl_SAhLmGeOm_.#c#oamlmg.ob,u fNfCSCiLz_ePsR[ONTCOC_L#_#PpRrOoTtOo_>S(I)M.PrLuEn](/&NnCcCcLl_SShTmEePmS./wsoirzke)o;f (\T ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h group(group: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 666 : 9 :t inote: din instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), n t666h | r e a d s ( n t hprreiamdss()ti,d ,t indTIhnrBelaodcskG(atthhreer,a ddIidrxe.cxt)-,> ugpr,o uNpU(LgLr,o uapr)g,s - >| s ^~~~~~~~~~~~~~~~~e ndb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:,60 :a rnote: gfield 'group' will be initialized after field 'stepSize's ->recv b562u | f f , t| i ^d (tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (nthr e202a | d s ) , t i d IRnuBnlWoocrkk(EtlhermeeandtI().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562I:M15P:L Ewarning: ,initializer order does not match the declaration order [-Wreorder-ctor] Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid), n t391h | r e aRdusn(Wnotrhkrp,) ,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ A L| G tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O _##algo ,563 | N C C L _sPtReOpTSOi_z#e#(pnrcoctloS>h(m)e.mr.ucno(m&mn.cbculfSfhSmiezme.sw[oNrCkC)L;_ P\R O T| O ^ _SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15]:/ Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_STEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d687I:d11x:. xnote: )in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, group(g r687o | u p ) , | ^~~~~~~~~~~~~~~~~ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s60(:t inote: dfield 'group' will be initialized after field 'stepSize'- tidSta r562t | B c a s tt,i dn(Tthirde)a,d snBtcharseta,d s&(dnitrherceta-d>so)u,t ,t induIlnlBpltorc,k (atrhgrse-a>dsIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::666677::911:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | 677 | p r i m sp(rtiimds,( tniTdh-rteiaddSstGaartthBecra,s td,i rneTchtr-e>audps,B cNaUsLtL,, &adrigrse-c>ts-e>nodubtu,f fd,i raercgts-->>droewcnv,b uafrfg,s - >| s ^e ndbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a202r:g53s:- >note: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree cvbuf f202, | | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o202r:k53E:l enote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ntn(t)<.Frnu,n (Tw,e )R;e d O| p ^, Algo, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppr:o11t:o1>:( )note: .in instantiation of member function 'RunWork, 2, 2>::run' requested herer un(we )11; | I M| P ^L _COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppU:N11C:(1A:l lnote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested heree duce, C11O | LILMNPELT__CDOILRLE_CFTU,N CS(IAMlPlLREe,d uMcaex,, CfOlLoLaNtE)T _ D| I^R ECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :S391I:M95P:L Enote: ,expanded from macro 'IMPL_COLL_FUNC' Max, floa t391) | | R^u nWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^ :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:: 12warning: :initializer order does not match the declaration order [-Wreorder-ctor]1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_F U562N | C ( A l ltRiedd(utcied,) ,C OnLtLhNrEeTa_dDsI(RnEtChTr,e aSdIsM)P,L Et,i dMIanxB,l odcoku(btlher)e a d| I^d x.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r391o:u95p:( gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunWor k563< | n c c l Fsutnecp#S#ifzuen(cn,c ctlySphem,e mF.ucnocm#m#.dbeuvfrfeSdiozpeL,_ PNRCOCTLO__ASLIGMOP_L#E#]a/lNgCoC,L _NSCTCELP_SP/RsOiTzOe_o#f#(pTr)o)t o{> ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(group& ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h11::562 :note: 15in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: field 'nthreads' will be initialized after field 'tidInBlock' 677 | 562 | t i d ( tpirdi)m,s (nttihdr-etaiddsS(tnatrhtrBecaadsst),, ntTihdrIenaBdlsoBccka(stth,r e&addiIrdexc.tx-)>,o ugtr,o udpi(rgercotu-p>)d,o w n| , ^~~~~~~~~~~~~~~~~ arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s60e:n dnote: bfield 'group' will be initialized after field 'stepSize'u ff, ar g562s | - > r e ctvibdu(ftfi,d ) ,| ^n threads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, tidIn B202l | o c k ( t h r e aRduIndWxo.rxk)E,l egmreonutp<(Fgnr,o uTp,) ,R e dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 15warning: :initializer order does not match the declaration order [-Wreorder-ctor] warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_SMIPMLPEL]E/]N/CNCCLC_LS_TSETPESP/Ss/isziezoefo(fT()T))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::655687::1111:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655687 | | pprriimmss((ttiidd--ttiiddSSttaarrttBRceadsutc,e ,n TnhTrheraedasdBscRaesdtu,c e&,d inruelcltp-t>ro,ut ,& dniurlelcptt-r>,o uatr,g sa-r>gsse-n>dsbeunfdfb,u fafr,g sa-r>grsec-v>bruefcfv,b u f| f ^, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:o562u:b15l:e )warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | t391i | d ( tRiudn)W,o rnkt ,g rNoCuCpL(_gArLoGuOp_)#,# a l| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o , | N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_PROT O563_ | # # p r osttoe>p(S)i.zreun((n&cncclcSlhSmhemme.mc.owmomr.kb)u;f f\S i z| e ^s [NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:O562T:O15_:S Inote: Mfield 'nthreads' will be initialized after field 'tidInBlock'P LE]/NCC L562_ | S T E P S/tsiidz(etoifd()T,) )n t{h r e| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ( group(groupn threads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l626o:c9k:( tnote: hin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadIdx. x626) | , g r o u p ( gprroiumps)(,t i d| - ^~~~~~~~~~~~~~~~~t idS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:a562r:t60S:c anote: tfield 'group' will be initialized after field 'stepSize' ter, n T562h | r e a d stSicda(tttiedr),, NnUtLhLr,e addisr(enctth-r>euapd,s )a,r gtsi-d>IsneBnldobcukf(ft,h raeragdsI-d>xr.exc)v,b ugfrfo,u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^~~~~~~~~~~202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:r15e:c vwarning: binitializer order does not match the declaration order [-Wreorder-ctor]u ff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n ttu(p)(.grruonu(pw)e,) ;| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13 :5631 | : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here stepS i13z | eI(MnPcLc_lCSOhLmLe_mF.UcNoCm(mA.lbluRfefdSuiczee,s[ NCCOCLLL_NPERTO_TDOI_RSEICMTP,L ES]I/MNPCLCEL,_ SMTaExP,S /rsciczle_obff(lTo)a)t 1{6 ) | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655 :39111 | : note: Rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nWorke,a dNsCRCeLd_uAcLeG,O _n#u#lallpgtor,, N&CdCiLre_cPtR-O>ToOu_t#,# parrogtso->>s(e)n.drbuunf(f&,n cacrlgSsh-m>erme.cwvobrukf)f;, \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202562 | | t iRdu(ntWiodr)k,E lnetmhernetah(r)e.arduInd(xw.ex));, g| r ^o up(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppp:)13,: 1 :| ^~~~~~~~~~~~~~~~~note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 6013: | Inote: Mfield 'group' will be initialized after field 'stepSize'P L_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nMBalxo,c kr(ctchlr_ebafdlIodaxt.1x6)), g| r^o up(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:)391,: 95 :| ^~~~~~~~~~~note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | sthmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: d-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: field 'group' will be initialized after field 'stepSize' :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ax, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 15 group(group: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i626d:)9,: nnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads(nt h626r | e a d s ) , t ipdrIinmBsl(otcikd(-tthirdeSatdaIrdtxS.cxa)t,t egrr,o unpT(hgrreoaudps)S,c a t| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e r ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N ULL, di r563e | c t - > uspt,e paSrigzse-(>nscecnldSbhumfefm,. caormgms.-b>urfefcSvibzuefsf[,N C C| L ^_ PROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:M202P:L53E:] /note: Nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC CL_STE P202S | / s i z e o f ( TR)u)n W{o r k| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l e m| e group(groupn t, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP roto>() .666r | u n ( w e ) ; p| r ^i ms(tid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :n4T:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres Gathe r4, | IdMiPrLe_cCtO-L>Lu_pF,U NNCU(LALl,l Raerdgusc-e>,s eCnOdLbLuNfEfT,_ DaIrRgEsC-T>,r eScIvMbPuLfEf,, M i| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(In file included from nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:i1d: InIn file included from B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o10c: kIn file included from (t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hh:r167e: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )warning: , initializer order does not match the declaration order [-Wreorder-ctor]g roup(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ kElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: warning: field 'nthreads' will be initialized after field 'tidInBlock'initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:) ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:)562,: 15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads(nthreads), 562t | i d I n Btliodc(k(ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t, i d| I ^~~~~~~~~~~n Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,391 | N C CRLu_nAWLoGrOk_<#n#cacllgFou,n cN#C#CfLu_nPcR,O TtOy_p#e#,p rFoutnoc>#(#)d.ervurne(d&onpce,m .NwCork); \ | ^ C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:L562_:A15L:G Onote: _field 'nthreads' will be initialized after field 'tidInBlock'# #algo, N562C | C L _ P RtOiTdO(_t#i#dp)r,o tnot>h(r)e.ardusn((n&tnhcrcelaSdhsm)e,m .twiodrIkn)B;l o\c k (| t ^h readIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15(:g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p), | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g60r:o unote: pfield 'group' will be initialized after field 'stepSize') , | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562{: 15 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] | group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641t:i11d:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nthrea d641s | ( n t h r e a d s ) ,p rtiimdsI(ntBildo-ctki(dtShtraeratdRIeddxu.cxe),, ngTrhoruepa(dgsrRoeudpu)c,e , | d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i r e| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t ->down ,563 | & d i r esctte-p>Soiuzte,( nacrcglsS-h>mseemn.dcboumfmf.,b uafrfgSsi-z>erse[cNvCbCuLf_fP,R O T| O ^_ SIMPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202C:L53_:S Tnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP S/siz e202o | f ( T ) ) { R| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n W o| r group(groupk Element, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo >().run (687w | e ) ; | ^ prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp(:t6i:d1-:t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested hereS tartBc a6s | tI,M PnLT_hCrOeLaLd_sFBUcNaCs(tA,l l&Rdeidrueccet,- >CoOuLtL,N EnTu_lDlIpRtErC,T ,a rSgIsM-P>LsEe,n dMbiunf,f ,i natr3g2s_-t>)r e c| v^b uff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: 391in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | Run W202o | r k < n c c l F uRnucn#W#ofruknEcl,e mteynpte<,F nF,u nTc,# #RdeedvOrpe,d oAplr,o NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementi(d)).,r unnt(hwree)a;d s (| n ^t hreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:d6I:n1B:l onote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herek (threa d6I | dIxM.PxL)_,C OgLrLo_uFpU(NgCr(oAulpl)R,e d u| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e , | C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O LLNET_DIR E563C | T , S IsMtPeLpES,i zMei(nn,c cilnSth3m2e_mt.)c o m| m^. buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e391s:[95N:C Cnote: Lexpanded from macro 'IMPL_COLL_FUNC'_ PROTO_SIM P391L | E ] /RNuCnCWLo_rSkT, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_666A:L9G:O _note: #in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# algo, N C666C | L _ P R O T O _ #p#rpirmost(ot>i(d),. rnuTnh(r&enacdcslGSahtmheemr.,w odrikr)e;c t\- > u| p ^, NULL,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562r:g15s:- >note: sfield 'nthreads' will be initialized after field 'tidInBlock'e ndbuff, 562a | r g s - >triedc(vtbiudf)f,, n t| h ^r eads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53):, note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei dInBlo c202k | ( t h r e a d I dRxu.nxW)o,r kgErloeumpe(ngtr(). r562u | n ( w e )t;i d (| t ^i d), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppa:d6s:(1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads), 6t | iIdMIPnLB_lCoOcLkL(_tFhUrNeCa(dAIldlxR.exd)u,c eg,r oCuOpL(LgNrEoTu_pD)I,R E C| T ^~~~~~~~~~~, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgroup),: 562| : ^~~~~~~~~~~~~~~~~15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :562 | note: field 'group' will be initialized after field 'stepSize' tid(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p (group) ,563 | | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthread s)| , group(group tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h7::5621::15 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herewarning: initializer order does not match the declaration order [-Wreorder-ctor] 7 | IMPL_COLL_FUNC(AllReduce, COLL N562E | T _ D I RtEiCdT(,t iSdI)M,P LnEt,h rMeiand,s (unitnhtr3e2a_dts)) , | t^i dInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k391(:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Idx.x), g391r | o u pR(ugnrWoourpk)<,n c c| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~F u n| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #func, t563y | p e , Fsutnecp#S#idzeev(rnecdcolpSc,o mNmC.CbLu_fAfLSGiOz_e#s#[aNlCgCoL,_ PNRCOCTLO__PSRIOMTPOL_E#]#/pNrCoCtLo_>S(T)E.PrSu/ns(i&znecocfl(STh)m)e m{. w o| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ) ;| group(group\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 666 : 9 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), n t666h | r e a d s ( n t hprreiamdss()t,i dt,i dnITnhBrleoacdks(Gtahtrheeard,I ddxi.rxe)c,t -g>ruopu,p (NgUrLoLu,p )a,r g s| - ^~~~~~~~~~~~~~~~~> sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:b562u:f60f:, note: afield 'group' will be initialized after field 'stepSize'r gs->re c562v | b u f f ,t i d| ( ^t id), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53(:n tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eads), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~o , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hWorkElem:e562nt:<15F:n , warning: T,initializer order does not match the declaration order [-Wreorder-ctor] R edOp, Algo, Proto>().run(we); 562 | | ^ tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppB:l7o:c1k:( tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eadIdx.x), 7g | rIoMuPpL(_gCrOoLuLp_)F,U N C| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~A l l| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e duce, C O563L | L NE T _ DsItReEpCSTi,z eS(InMcPcLlES,h mMeimn.,c oumimn.tb3u2f_ftS)i z e| s^[ NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_391S:I95M:P Lnote: Eexpanded from macro 'IMPL_COLL_FUNC'] /NCCL_STE P391S | / s iRzuenoWfo(rTk)<)n c{c l F| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n c #| # group(groupf unc, type, Func##devredop, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_677A:L11G:O _note: #in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# algo, NCC L677_ | P R O T O _ # # p r optroi>m(s)(.triudn-(t&indcSctlaSrhtmBecma.swto,r kn)T;h r\e a d| s ^B cast, &dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:t562-:>15o:u tnote: ,field 'nthreads' will be initialized after field 'tidInBlock' direct->d o562w | n , a rtgisd-(>tsiedn)d,b unftfh,r eaardgss(-n>trhercevabdusf)f,, t i| d ^I nBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202I:d53x:. xnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, group (202g | r o u p ) , | R ^~~~~~~~~~~~~~~~~u nWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l60e:m enote: nfield 'group' will be initialized after field 'stepSize't e(a)d.sr(unnt(hwree)a;d s )| , ^ tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppc:k6(:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered Idx.x )6, | IgMrPoLu_pC(OgLrLo_uFpU)N,C ( A| l ^~~~~~~~~~~l Reduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd), nth:re562a:d15s:( ntwarning: hrinitializer order does not match the declaration order [-Wreorder-ctor]e ads), tidInBlock(threadIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~a ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: 562 | initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I d x| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ), grou p563( | g r o u ps)t,e p S| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~z e (| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c clShme m563. | c o m m .sbtuefpfSSiizzee(sn[cNcClCSLh_mPeRmO.TcOo_mSmI.MbPuLfEf]S/iNzCeCsL[_NSCTCELP_SP/RsOiTzOe_oSfI(MTP)L)E ]{/ N C| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ S| T group(groupE PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | 687 | p r i m s ( t i dp-rtiimdsS(ttairdt-BtciadsStt,a rntTBhcraesatd,s BncTahsrte,a d&sdBicraesctt,- >&oduitr,e cntu-l>loputtr,, naurlglsp-t>rs,e nadrbgusf-f>,s eanrdgbsu-f>fr,e cavrbgusf-f>,r e c| v ^b uff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: 202note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202R | u n W o r k E l eRmuennWtog(o),. rPurno(twoe>)(;) . r| u ^n (we); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1 :8 | note: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hereM PL_CO L8L | _IFMUPNLC_(CAOlLlLR_eFdUuNcCe(,A lClORLeLdNuEcTe_,D ICROELCLTN,E TS_IDMIPRLEEC,T ,M iSnI,M PiLnEt,6 4M_itn), i| n^t 64_t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hexpanded from macro 'IMPL_COLL_FUNC': 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru n391W | o r kRp,< tNyCpCeL>_,A LNGCOC_L#_#AaLlGgOo_,# #NaClCgLo_,P RNOCTCOL__#P#RpOrToOt_o#>#(p)r.ortuon>((&)n.crculnS(h&mnecmc.lwSohrmke)m;. w\o r k| ) ^; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'nthreads' will be initialized after field 'tidInBlock'15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 60field 'group' will be initialized after field 'stepSize': note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(nthreaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~o up), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15P:R Owarning: Tinitializer order does not match the declaration order [-Wreorder-ctor]O _SIMPLE]/NCCL_S T562E | P S / s itziedo(ft(iTd))), {n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group( nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthreads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x) ,563 | g r o u ps(tgerpoSuipz)e,( n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l S h| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e m.comm. b563u | f f S i zsetse[pNSCiCzLe_(PnRcOcTlOS_hSmeImM.PcLomEm]./bNuCfCfLS_iSzTeEsP[SN/CsCiLz_ePoRfO(TTO)_)S I{M PL E| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n15t:6 4warning: _initializer order does not match the declaration order [-Wreorder-ctor]t ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | ti d391( | t i dR)u,n Wnotrhkrr,o uNpC(CgLr_oAuLpG)O,_ # #| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l g o| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) NCCL_PR O563T | O _ # # psrtoetpoS>i(z)e.(rnucnc(l&Snhcmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShmem. work); \ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid), nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d x .| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , grou p563( | g r o u ps)t,e p S| i ^~~~~~~~~~~~~~~~~z e(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S60h:m enote: mfield 'group' will be initialized after field 'stepSize'. comm.b u562f | f S i z etsi[dN(CtCiLd_)PRO,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .warning: x)initializer order does not match the declaration order [-Wreorder-ctor], group(group), | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p (group) ,563 | | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:,15 :a rwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]s ->recvbuff, | ^ 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:)53,: nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh reads( n202t | h r e a d s ) , RtuindWIonrBklEolcekm(etnhtr| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) . r| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n (we); | 563 ^ | stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:z9e:(1n:c cnote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereS hmem. c9o | mImM.PbLu_fCfOSLiLz_eFsU[NNCC(CALl_lPRReOdTuOc_eS,I MCPOLLEL]N/ENTC_CDLI_RSETCETP,S /SsIiMzPeLoEf,( TM)i)n ,{ u i| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t 6 4| _ group(groupt ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h391::65595::11 :note: expanded from macro 'IMPL_COLL_FUNC'note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391655 | | R u n W o r k < npcrcilmFsu(ntci#d#-ftuindcS,t atrytpRee,d uFcuen,c #n#TdherveraeddsoRpe ,n uNlClCpLt_rA,L G&Od_i#r#eacltg-o>,o uNtC,C La_rPgRsO-T>Os_e#n#dpbruoftfo,> (a)r.grsu-n>(r&enccvcbluSfhfm,e m .| w ^o rk); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :202 | note: field 'nthreads' will be initialized after field 'tidInBlock' R u562n | W o r k Etliedm(etnitd<)F,n ,n tTh,r eRaeddsO(pn,t hArlegaod,s )P,r ottiod>I(n)B.lroucnk((wteh)r;e a d| I ^d x.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:u8p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | ^~~~~~~~~~~~~~~~~8 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562_:C60O:L Lnote: _field 'group' will be initialized after field 'stepSize'F UNC(Al l562R | e d u c et,i dC(OtLiLdN)E,T _nDtIhRrEeCaTd,s (SnItMhPrLeEa,d sM)i,n ,t iidnItn6B4l_otc)k ( t| h^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.391x:)95,: gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up(group )391, | | R ^~~~~~~~~~~u nWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N562C | C L _ P RtOiTdO(_t#i#dp)r,o tnot>h(r)e.ardusn((n&tnhcrcelaSdhsm)e,m .twiodrIkn)B;l o\c k (| t ^h readIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(group )562, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( tid), n t563h | r e a d ss(tnetphSriezaed(sn)c,c ltSihdmIenmB.lcoocmkm(.tbhurfefaSdIdx.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_)S,I M P| L ^~~~~~~~~~~~~~~~~E ]/NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:S60T:E Pnote: Sfield 'group' will be initialized after field 'stepSize'/ sizeof( T562) | ) { t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ( t| i group(groupd ), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r626e:a9d:I dnote: xin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. x), group (626g | r o u p ) , | p ^~~~~~~~~~~r ims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(nthreads), tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~d InB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k60(:t hnote: rfield 'group' will be initialized after field 'stepSize'e adIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nthread s563) | , t i dsItneBplSoiczke((tnhcrcelaSdhImdexm..xc)o,m mg.rbouufpf(Sgirzoeusp[)N,C C L| _ ^~~~~~~~~~~P ROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: field 'nthreads' will be initialized after field 'tidInBlock'warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | tid( t563i | d ) , nsttherpeSaidzse((nntchcrleSahdmse)m,. ctoimdmI.nbBulfofcSki(ztehsr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~P S/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIdM)P,L En]t/hNrCeCaLd_sS(TnEtPhSr/esaidzse)o,f (tTi)d)I n{B l o| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ( t| h group(groupr eadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), nthreads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c l S| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m em.com m563. | b u f f SsitzeepsS[iNzCeC(Ln_cPcRlOShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]T O_SIMPLE]/NCCL_STEP S562/ | s i z e otfi(dT()t)i d{) , | n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u626p:(9g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 626 | 563 | s t epprSiimzse((tnicdc-ltSihdmSetma.rctoSmcma.tbtuefrf,S inzTehsr[eNaCdCsLS_cPaRtOtTeOr_,S INMUPLLLE,] /dNiCrCeLc_tS-T>EuPpS,/ sairzgeso-f>(sTe)n)d b{u f f| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a r| g group(groups ->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::53655:: 11note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 202 | 655 | R u n W o rpkrEilmesm(etnitd<-Ftni,d STt,a rRteRdeOdpu,c eA,l gnoT,h rPeraodtsoR>e(d)u.creu,n (nwuel)l;p t r| , ^ &direct->out/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp,: 11a:r1g:s -note: >in instantiation of member function 'RunWork, 2, 2>::run' requested heres endb u11f | fI,M PaLr_gCsO-L>Lr_eFcUvNbCu(fAfl,l R e| d ^u ce, COLLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:_202D:I53R:E Cnote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, SIMP L202E | , M i n , f lRouantW)o r k| E^l ementn(W)ork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:: 562note: :expanded from macro 'IMPL_COLL_FUNC'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunWorkr,e aNdCsC)L,_ AtLiGdOI_n#B#laolcgko(,t hNrCeCaLd_IPdRxO.TxO)_,# #gprrooutpo(>g(r)o.urpu)n,( & n| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c l S| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m em.work )563; | \ | s ^t epSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e15m:. cnote: ofield 'nthreads' will be initialized after field 'tidInBlock'm m.buffS i562z | e s [ N CtCiLd_(PtRiOdT)O,_ SnItMhPrLeEa]d/sN(CnCtLh_rSeTaEdPsS)/,s itziedoIfn(BTl)o)c k{( t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| I group(groupd x.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 666field 'group' will be initialized after field 'stepSize': 9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | t i666d | ( t i d ) , n tphrims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 202 | RunWorkElem e562n | t < F n ,t iT, RedOp, Algo, dP(rtoitdo)>,( )n.trhurne(awdes)(;n t h| r ^e ads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :g13r:o1u:p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup), 13| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I M P| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ COLL_FU N563C | ( A l l RsetdeupcSei,z eC(OnLcLcNlESTh_mDeImR.EcCoTm,m .SbIuMfPfLSEi,z eMsi[nN,C CrLc_cPlR_ObTfOl_oSaItM1P6L)E ] /| N^C CL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:i391z:e95o:f (note: Texpanded from macro 'IMPL_COLL_FUNC') ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 391 group(group | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF unc##dev r626e | d o p < t y p e >p,r iNmCsC(Lt_iAdL-GtOi_d#S#taalrgtoS,c aNtCtCeLr_,P RnOTThOr_e#a#dpsrSoctaot>t(e)r.,r uNnU(L&Ln,c cdliSrhemcetm-.>wuopr,k )a;r g\s - >| s ^e ndbuff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:r562e:c15v:b unote: ffield 'nthreads' will be initialized after field 'tidInBlock'f , | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:(53t:i dnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nthre a202d | s ( n t h r e a dRsu)n,W otrikdEIlneBmleonctk<(Ftnh,r eTa,d IRdexd.Oxp),, Aglrgoou,p (Pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~n (we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :56213 | : 1 : note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herei d(tid) ,13 | nItMhPrLe_aCdOsL(Ln_tFhUrNeCa(dAsl)l,R etdiudcIen,B lCoOcLkL(NtEhTr_eDaIdRIEdCxT.,x )S,I MgPrLoEu,p (Mgirno,u pr)c,c l _| b ^~~~~~~~~~~f loat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkinitializer order does not match the declaration order [-Wreorder-ctor], NCCL_ALGO_##algo, N562C | C L _ P RtOiTdO(_t#i#dp)r,o tnot>h(r)e.ardusn((n&tnhcrcelaSdhsm)e,m .twiodrIkn)B;l o\c k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllRedu | ce, COLLNET_DI R E C Tt,i dS(ItMiPdL)E,, nMtihnr,e ardcsc(ln_tbhfrleoaadts1)6,) t i| d^I nBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC'( group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x) 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::687:56211::15 :note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herewarning: initializer order does not match the declaration order [-Wreorder-ctor] 687 | pr i562m | s ( t i dt-itdi(dtSitda)r,t Bnctahsrte,a dnsT(hnrtehardesaBdcsa)s,t ,t i&ddIinrBelcotc-k>(otuhtr,e anduIldlxp.txr),, agrrgosu-p>(sgernodubpu)f,f , | a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r g s| - tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)> recvbuf f563, | | ^ stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:c202l:S53h:m enote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. comm. b202u | f f S i z e s [ NRCuCnLW_oPrRkOETlOe_mSeInMtP{( ) .| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u n (| w group(groupe ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here13 | I MPL_COL L666_ | F U N C ( A l l Rperdiumcse(,t iCdO,L LnNTEhTr_eDaIdRsEGCaTt,h eSrI,M PdLiEr,e cMti-n>,u pr,c cNlU_LbLf,l oaartg1s6-)> s e| n^d buff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r391g:s95-:> rnote: eexpanded from macro 'IMPL_COLL_FUNC'c vbuff, | 391 ^ | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<:n202c:c53l:F unote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec ##fun c202, | t y p e , Func##de RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 67 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hr:g514s:-9>:r ewarning: cvariable 'offset' set but not used [-Wunused-but-set-variable]v buff, | ^ 514 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:n202t: 53o:f fnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree t = t i202d | ; | ^ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | 562 | R u n W otrikd<(ntcicdl)F,u nnct#h#rfeuandcs,( nttyhpree,a dFsu)n,c #t#iddeIvnrBeldoocpk<(ttyhpree>a,d INdCxC.Lx_)A,L GgOr_o#u#pa(lggroo,u pN)C,C L _| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R O T| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##proto >563( | ) . r u ns(t&enpcScilzSeh(mnecmc.lwSohrmke)m;. c\o m m| . ^b uffSizes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O15T:O _note: Sfield 'nthreads' will be initialized after field 'tidInBlock'I MPLE]/NCC L562_ | S T E P St/isdi(zteiodf)(,T )n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ( n| t group(grouph reads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :g641r:o11u:p (note: gin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup), | ^~~~~~~~~~~~~~~~~ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : note: field 'group' will be initialized after field 'stepSize'p rims(t i562d | - t i d Sttiadr(ttRiedd)uc,e ,n tnhTrheraedasd(snRtehdruecaed,s )di,r etcitd-I>ndBolwocnk,( t&hdrieraedcItd-x>.oxu)t,, garrogusp-(>gsreonudpb)u,f f ,| ^~~~~~~~~~~a rgs->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h int8:_562t:)15 : | warning: ^ initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'562 | tid(tid )391, | n tRhurneWaodrsk(g,r oNuCpC)L,_ A L| G ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ #| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a lgo, NC C563L | _ P R O TsOt_e#p#Spirzoet(on>c(c)l.Srhumne(m&.nccocmlmS.hbmuefmf.Swiozreks)[;N C\C L _| P ^R OTO_SIMPLE]/NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:P562S:/15s:i znote: efield 'nthreads' will be initialized after field 'tidInBlock'o f(T)) { 562| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d677I:n11B:l onote: cin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek (threadId x677. | x ) , g r o u p ( gprroiumps)(,t i d| - ^~~~~~~~~~~~~~~~~t idSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562t:B60c:a snote: tfield 'group' will be initialized after field 'stepSize', nThre a562d | s B c a stti,d (&tdiidr)e,c tn-t>horueta,d sd(inrtehcrte-a>ddso)w,n ,t iadrIgnsB-l>oscekn(dtbhurfefa,d Iadrxg.sx-)>,r egcrvobuupf(fg,r o u| p ^) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock(threadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R OTO_SIM P563L | E ] / N CsCtLe_pSSTiEzPeS(/nsciczleSohfm(eTm).)c o{m m .| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| S group(groupi zes[NCCL_PROTO_SIMPLE]/NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:P687S:/11s:i znote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo f(T)) { | 687 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group prims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d655S:t11a:r tnote: Bin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec ast, nThr e655a | d s B c a s t , & dpirriemcst(-t>iodu-tt,i dnSutlalrpttRre,d uacreg,s -n>TshernedabdusfRfe,d uacreg,s -n>urlelcpvtbru,f f&,d i r| e ^c t->out, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:-53>:s enote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered buff, a202r | g s - > r e c v bRuufnfW,o r k| E ^l ement, 2, 2>::run' requested hereO p, Alg o202, | P r o t o > ( )R.urnuWno(rwkeE)l;e m e| n ^t , 2, 2>::run' requested hereo , Pro t4o | >I(M)P.Lr_uCnO(LwLe_)F;U N C| ( ^A llReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :C4O:L1L:N Enote: Tin instantiation of member function 'RunWork, 2, 2>::run' requested here_ DIREC T4, | ISMIPMLP_LCEO,L LS_uFmU,N Ci(nAtl8l_Rte)d u c| e^, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:D391I:R95E:C Tnote: ,expanded from macro 'IMPL_COLL_FUNC' SIMPLE, S u391m | , iRnutn8W_otr)k < n| c^c lFunc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:f391u:n95c:, note: texpanded from macro 'IMPL_COLL_FUNC'y pe, Func# #391d | e v rReudnoWpoc,l FNuCnCcL#_#AfLuGnOc_,# #taylpgeo,, FNuCnCcL#_#PdReOvTrOe_d#o#pp>(,) .NrCuCnL(_&AnLcGcOl_S#h#maelmg.ow,o rNkC)C;L _\P R O| T ^O _##proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r15u:n (note: &field 'nthreads' will be initialized after field 'tidInBlock'n cclShme m562. | w o r k )t;i d\( t i| d ^) , nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15(:n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~B lock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r60e:a dnote: Ifield 'group' will be initialized after field 'stepSize'd x.x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~l ock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 15 : swarning: tinitializer order does not match the declaration order [-Wreorder-ctor]e pSize(ncclShmem.comm. b562u | ff S i z etsi[dN(CtCiLd_)P,R OnTtOh_rSeIaMdPsL(En]t/hNrCeCaLd_sS)T,E PtSi/dsIinzBeloofc(kT()t)h r{e a d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d x .| x group(group) , group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~666 : 9| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 666 | s t e p S i z ep(rnicmcsl(Sthimde,m .ncTohmrme.abdusfGfaStihzeers,[ NdCiCrLe_cPtR-O>TuOp_,S INMULPLL,E ]a/rNgCsC-L>_sSeTnEdPbSu/fsfi,z eaorfg(sT-)>)r e{c v b| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f f ,| group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 666202: | 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunW o666r | k E l e m e n tp().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::5624::151:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_ F562U | N C ( Al ltRiedd(utcied,) ,C OnLtLhNrEeTa_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlgo:,562 :N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]P ROTO_##proto>().run(& n562c | c l S h mteimd.(wtiod), nrtkh)r;e a\d s (| n ^t hreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d15x:. xnote: )field 'nthreads' will be initialized after field 'tidInBlock', group(group), 562| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id(tid) ,563 | n t h r esatdesp(Snitzher(enacdcsl)S,h mteimd.IcnoBmlmo.cbku(ftfhSriezaedsI[dNxC.CxL)_,P RgOrToOu_pS(IgMrPoLuEp])/,N C C| L ^~~~~~~~~~~~~~~~~_ STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:S562/:s60i:z enote: ofield 'group' will be initialized after field 'stepSize'f (T)) { 562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d666s:)9:, note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei dInBloc k666( | t h r e a d I d xp.rxi)m,s (gtriodu,p (ngTrhoruepa)d,s G a| t ^~~~~~~~~~~h er, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:60: note: :field 'group' will be initialized after field 'stepSize'562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' ROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::5621:: 15In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :warning: 10initializer order does not match the declaration order [-Wreorder-ctor]: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(group )563, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ s t| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p Size( n563c | c l S h msetme.pcSoimzme.(bnucfcflSSihzmeesm[.NcCoCmLm_.PbRuOfTfOS_iSzIeMsP[LNEC]C/LN_CPCRLO_TSOT_ESPISM/PsLiEz]e/oNfC(CTL)_)S T{E P S| / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s i z| e group(groupo f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: 687in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | 626 | p r i m s ( t ipdr-itmisd(Sttiadr-ttBicdaSstta,r tnSTchartetaedrs,B cnaTshtr,e a&ddsiSrceacttt-e>ro,u tN,U LnLu,l ldpitrre,c ta-r>gusp-,> saerngdsb-u>fsfe,n dabrugfsf-,> raercgvsb-u>frfe,c v b| u ^f f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hMPLE, S:u562m:,15 :u iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t 8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t iRdu(ntWiodr)k,< nnctchlrFeuandcs#(#nftuhnrce,a dtsy)p,e, tFiudnIcn##Bdleovcrke(dtohprx,. xN)C,C Lg_rAoLuGpO(_g#r#oaulpg)o,, N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R OTO_##p r563o | t o > ( )s.treupnS(i&znec(cnlcSchlmSehmm.ewmo.rcko)m;m .\b u f| f ^S izes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Tnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'_ SIMPLE ]562/ | N C C L _tSiTdE(PtSi/ds)i,z enotfh(rTe)a)d s{( n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp(:t5i:d1):, note: nin instantiation of member function 'RunWork, 2, 2>::run' requested heret hread s5( | nItMhPrLe_aCdOsL)L,_ FtUiNdCI(nABllloRcekd(utcher,e aCdOILdLxN.ExT)_,D IgRrEoCuTp,( gSrIoMuPpL)E,, S| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m , | u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i nt8_t) 563| | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i391z:e95(:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Shmem.co m391m | . b uRfufnSWiozreks<[nNcCcClLF_uPnRcO#T#Of_uSnIcM,P LtEy]p/eN,C CFLu_nScT#E#PdSe/vsriezdeoopf<(tTy)p)e >{, N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| A group(groupL GO_##algo, NCCL_PROTO_##proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>:(655):.11r:u nnote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here& ncclShmem .655w | o r k ) ; \ | ^p rims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:-562t:i15d:S tnote: afield 'nthreads' will be initialized after field 'tidInBlock'r tReduce ,562 | n T h r etaidds(Rteiddu)c,e ,n tnhurlelapdtsr(,n t&hdrieraedcst)-,> otuitd,I naBrlgosc-k>(stehnrdebaudfIfd,x .axr)g,s -g>rroeucpv(bgurfofu,p ) ,| ^ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::20260::53 :note: field 'group' will be initialized after field 'stepSize'note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t iRdu)n,WorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidSta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:t562S:c15a:t twarning: einitializer order does not match the declaration order [-Wreorder-ctor]r , nThreadsScatter, NU L562L | , d i rteicdt(-t>iudp),, anrtghsr-e>asdesn(dnbtuhfrfe,a dasr)g,s -t>irdeIcnvBbluofcfk,( t h| r ^e adIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)202,: 53g:r onote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herep (grou p202) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R unWorkE l563e | m e n t f(f)S.irzuens([wNeC)C;L _ P| R ^O TO_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppL:E4]:/1N:C Cnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested here_ STEPS /4s | iIzMePoLf_(CTO)L)L _{F U N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( A l| l group(groupR educe, COLLNET_DIREC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:,687 :S11I:M Pnote: Lin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereE , Sum, i n687t | 8 _ t ) | ^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(391t:i95d:- tnote: iexpanded from macro 'IMPL_COLL_FUNC'd StartBcas t391, | n TRhurneWaodrskBout, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNEIn file included from T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp_:D1I: RIn file included from E/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:T10,: In file included from S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hI:M167P: L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:,562 :S15u:m ,warning: initializer order does not match the declaration order [-Wreorder-ctor]u int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid) ,391 | n t hRruenaWdosr(knr,o uNpC)C,L _ A| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~G | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: warning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :562 | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nt h563r | e562 a | d tid(st(indt)h,r en atsdhtsre)ep,aS ditszi(edn(ItnnhcBrcleloaScdhksm()et,mh .rtceioadmdImIn.dBbxlu.ofxcf)kS,(i tzgherrsoe[uaNpdC(ICgdLrx_o.PuxRp)O),T, O g_ rS| oI ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~uM pP (L| gE tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r] o/uNp C)563C, | L _ S| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ E sP tS| e/ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)ps Sii zz563ee | o( fn (c Tc )ls)St he{mp eS mi| .z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ce o( mn| mc group(group.c bluSfhfmSeimz.ecso[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hmN:mC677.C:bL11u_:fP fRnote: SOin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereiT zOe_sS [I677NM | CP CL LE _] P/ RN OC TC OL __ SSpITrMEiPPmLSsE/(]st/iiNzdCe-CotLfi_(dSTST)tE)aP rS{t/ Bs ci| az ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~se to ,f| ( group(groupnT T)h)r e{a d s| B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c: a666 st, :| &9 group(groupd: i rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec t->out /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,666: | 666d :i 9r :e c note: t in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- > dp or666wi | nm ,s ( at ri gd s, - >npsTrehinrmdbuff, args-e>recvbaudfsfG,a t h| e ^r , direct->up, NULL, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:-53>:s enote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered buff, a r202g | s - > r e c v b uRfufn,W o r| k ^E lement, 2, 2>::run' requested here, Proto> (202) | . r u n ( w e ) ;R u n| W ^o rkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp<:F6n:,1 :T ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereR edOp, 6A | lIgMoP,L _PCrOoLtLo_>F(U)N.Cr(uAnl(lwRee)d;u c e| , ^ COLLNET_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppD:I4R:E1C:T ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereS IMPLE ,4 | SIuMmP,L _iCnOtL3L2__FtU)N C (| A^l lReduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391C:O95L:L Nnote: Eexpanded from macro 'IMPL_COLL_FUNC'T _DIRECT ,391 | S I MRPuLnEW,o rSku , RNuCnCWLo_rAkLe(v)r.erduonp(<&tnycpcel>S,h mNeCmC.Lw_oArLkG)O;_ #\# a l| g ^o , NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:#15#:p rnote: ofield 'nthreads' will be initialized after field 'tidInBlock't o>().ru n562( | & n c c ltSihdm(etmi.dw)o,r kn)t;h r\e a d| s ^( nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkup, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]60 : note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, 7S | uImM,P Lu_iCnOtL8L__tF)U N C| (^A llReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:,391 :C95O:L Lnote: Nexpanded from macro 'IMPL_COLL_FUNC'E T_DIRECT ,391 | S I MRPuLnEW,o rSkunote: ,expanded from macro 'IMPL_COLL_FUNC' NCCL_ALGO_# #391a | l g oR,u nNWCoCrLk_c(,) .tryupne(,& nFcucnlcS#h#mdeemv.rweodrokp)<;t y\p e >| , ^ NCCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:a15l:g onote: ,field 'nthreads' will be initialized after field 'tidInBlock' NCCL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock'( group), 562| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o60u:p (note: gfield 'group' will be initialized after field 'stepSize'r oup), 562| | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:i562:z15e:o fwarning: (initializer order does not match the declaration order [-Wreorder-ctor]T )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t677i:d11):, note: nin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads(nth r677e | a d s ) , t i d I npBrliomcsk((ttihdr-etaiddISdtxa.rxt)B,c agsrto,u pn(Tghrroeuapd)s,B c a| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t , | & tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d irect- >563o | u t , dsitreepcSti-z>ed(onwcnc,l Sahrmgesm-.>csoemnmd.bbuuffff,S iazregss[-N>CrCeLc_vPbRuOfTfO,_ S I| M ^P LE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:S202T:E53P:S /note: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei zeof(T )202) | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | R group(groupu nWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo >().run( w677e | ) ; | ^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpps:(7t:i1d:- tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered Start B7c | aIsMtP,L _nCTOhLrLe_aFdUsNBCc(aAsltl,R e&dduicree,c tC-O>LouLtN,E Td_iDrIeRcEtC-T>,d oSwInM,P LaEr,g sS-u>ms,e nudibnutf3f2,_ ta)r g s| -^> recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202391: | 53 : Rnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Work< n202c | c l F u n c # # fRuunncW,o rtkyEplee,m eFnutng,o ,N CPCrLo_tAoL>G(O)_.#r#uanl(gwoe,) ;N C C| L ^_ PROTO_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:o4t:o1>:( )note: .in instantiation of member function 'RunWork, 2, 2>::run' requested herer un(&n c4c | lIShMmem.PwLo_rCkO)L;L _\F U N| C ^( AllReduce, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:D562I:R15E:C Tnote: ,field 'nthreads' will be initialized after field 'tidInBlock' SIMPLE, Sum, 562i | n t 8 _ tt)i d (| t^i d), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391(:n95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a ds), tidIn B391l | o c kR(utnhWroerakd, 562N | C C L _ AtLiGdO(_t#i#da)l,g on,t hNrCeCaLd_sP(RnOtThOr_e#a#dpsr)o,t ot>i(d)I.nrBulno(c&kn(ctchlrSehamdeImd.xw.oxr)k,) ;g r\o u p| ( ^g roup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:O562_:#15#:a lwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]o , NCCL_PROTO_##proto> (562) | . r u n (t&indc(ctliSdh)m,e mn.twhorreka)d;s (\n t h| r ^e ads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lnote: ofield 'nthreads' will be initialized after field 'tidInBlock'c k(threa d562I | d x . tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hds(nthre:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock(threadIdx.x), group(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | IMPL_:C562O:L15L:_ Fwarning: Uinitializer order does not match the declaration order [-Wreorder-ctor]N C(AllReduce, COLLNET_D I562R | E C T , tSiIdM(PtLiEd,) ,S unmt,h rienatd8s_(tn)t h r| e^a ds), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:I95n:B lnote: oexpanded from macro 'IMPL_COLL_FUNC'c k(thread I391d | x . xR)u,n Wgorrokul,S hNmCeCmL._cAoLmGmO._b#u#faflSgioz,e sN[CNCCLC_LP_RPORTOOT_O#_#SpIrMoPtLoE>](/)N.CrCuLn_(S&TnEcPcSl/Sshimzeemo.fw(oTr)k)) ;{ \ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626t:i9d:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nthr e626a | d s ( n t h r e apdrsi)m,s (ttiiddI-ntBildoSctka(rtthSrceaatdtIedrx,. xn)T,h rgeraoduspS(cgartotuepr),, N U| L ^~~~~~~~~~~~~~~~~L , d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:r562e:c60t:- >note: ufield 'group' will be initialized after field 'stepSize'p , args -562> | s e n d btuifdf(,t iadr)g,s -n>trhercevabdusf(fn,t h r| e ^a ds), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B202l:o53c:k (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh readI d202x | . x ) , g r o uRpu(ngWroorukpE)l,e m e| n ^~~~~~~~~~~t ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d )N,C CnLt_hArLeGaOd_s#(#natlhgroe,a dNsC)C,L _tPiRdOITnOB_l#o#cpkr(otthor>e(a)d.Irduxn.(x&)n,c cglrSohumpe(mg.rwoourpk)),; \| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15563: | note: field 'nthreads' will be initialized after field 'tidInBlock' stepSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687: 11562: | note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid(tid )687, | n t h r e a d s ( nptrhirmesa(dtsi)d,- ttiiddSItnaBrltoBccka(stth,r enaTdhIrdexa.dxs)B,c agsrto,u p&(dgirroeucpt)-,> o u| t ^~~~~~~~~~~, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthreads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c l S| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m em.comm .563b | u f f S iszteesp[SNiCzCeL(_nPcRcOlTSOh_mSeImM.PcLoEm]m/.NbCuCfLf_SSiTzEePsS[/NsCiCzLe_oPfR(OTT)O)_ S{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E ] /| N group(groupC CL_STEPS/sizeof(T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~11 : | note: group(groupin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 666 :p9r:i mnote: sin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid-tid S666t | a r t R e d u c ep,r inmTsh(rteiadd,s RneTdhurceea,d snGualtlhpetrr,, d&idriercetc-t>-u>po,u tN,U LaLr,g sa-r>gsse-n>dsbeunfdfb,u fafr,g sa-r>grse-c>vrbeucfvfb,u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::53202:: 53note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u nRWuonrWkoErlkeEmleenmtet(o)>.(r)u.nr(uwne()w;e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::75::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 75 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, SSuumm,, uuiinntt382__tt)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::95391:: 95note: :expanded from macro 'IMPL_COLL_FUNC' note: expanded from macro 'IMPL_COLL_FUNC' 391 | R u391n | W o rRku<,t yNpCeC>L,_ ANLCGCOL__#A#LaGlOg_o#,# aNlCgCoL,_ PNRCOCTLO__P#R#OpTrOo_t#o#>p(r)o.trou>n(()&.nrcucnl(S&hnmcecml.Swhomrekm).w;o r\k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~562 :60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'group' will be initialized after field 'stepSize': 60: note: field 'group' will be initialized after field 'stepSize' 562 | 562t | i d ( t itdi)d,( tnidt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hreadId:x.562x:)15,: gwarning: roinitializer order does not match the declaration order [-Wreorder-ctor]u p(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthreads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x), g563r | o u p ( gsrtoeuppS)i,z e (| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c c l| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h mem. c563o | m m . b usftfeSpiSziezse[(NnCcCcLl_SPhRmOeTmO._cSoImMmP.LbEu]f/fNSCiCzLe_sS[TNECPCSL/_sPiRzOeToOf_(STI)M)P L{E ] /| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupS TEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~677 : 11| : group(group note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 15field 'group' will be initialized after field 'stepSize': warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:( warning: initializer order does not match the declaration order [-Wreorder-ctor] tid), nthreads(nthreads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h stepSize(n:c562c:l15S:h mwarning: einitializer order does not match the declaration order [-Wreorder-ctor]m .comm.buffSizes[NCCL_PROTO_SIMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h.:x666):,9 :g rnote: oin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu p(group )666, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) prims(t i563d | , n T hsrteeapdSsiGzaet(hnecrc,l Sdhimreemc.tc-o>mump.,b uNfUfLSLi,z easr[gNsC-C>Ls_ePnRdObTuOf_fS,I MaPrLgEs]-/>NrCeCcLv_bSuTfEfP,S / s| i ^z eof(T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h{: 202 :| 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here group(group 202 | RunWorkElement, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo , Proto>( )687. | r u n ( w e ) ; | p ^r ims(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppt:i9d:S1t:a rnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereB cast, 9n | TIhMrPeLa_dCsOBLcLa_sFtU,N C&(dAilrleRcetd-u>coeu,t ,C OnLuLlNlEpTt_rD,I RaErCgTs,- >SsIeMnPdLbEu,f fS,u ma,r gusi-n>tr6e4c_vtb)u f f| ,^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202expanded from macro 'IMPL_COLL_FUNC': 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 391 | 202 | R u n W o r k < nRcucnlWFournkcE#l#efmuennct,< Fn, T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 60 : note: tfield 'group' will be initialized after field 'stepSize'i d(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( group), 563 | | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhmem.wo:r562k:)15;: \warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~e pSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562(:n60c:c lnote: Sfield 'group' will be initialized after field 'stepSize'h mem.co m562m | . b u f ftSiidz(etsi[dN)C,C Ln_tPhRrOeTaOd_sS(InMtPhLrEe]a/dNsC)C,L _tSiTdEIPnSB/lsoiczke(otfh(rTe)a)d I{d x .| x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | g group(groupr oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGath/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ er, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O15T:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# proto>().run(&ncclShmem. w562o | r k ) ; t\i d (| t ^i d), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nnote: Bfield 'nthreads' will be initialized after field 'tidInBlock'l ock(threadIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562:15 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n666c:c9l:S hnote: min instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.comm. b666u | f f S i z e s [ NpCrCiLm_sP(RtOiTdO, n_ThreadsGather, dirSeIcMtP-L>Eu]p/,N CNCULL_LS,T EaPrSg/ss-i>zseeonfd(bTu)f)f ,{ a r| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s - >| r group(groupe cvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :202677 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWork E677l | e m e n t < F n , Tp,r iRmesd(Otpi,d -AtligdoS,t aPrrtoBtcoa>s(t),. rnuTnh(rweea)d;s B c| a ^s t, &dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppe:c10t:-1>:o note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ut/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, di:r562e:c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15t:-: >562warning: d:initializer order does not match the declaration order [-Wreorder-ctor]o15 w:n ,warning: initializer order does not match the declaration order [-Wreorder-ctor]a rg s562- | > s e n562 d | tb iu df (f t,ti idad)r(,gt sin-dt>)hr,re ecnavtdbhsur(fenfat,dh sr (e| na ^td hsr)e,a dtsi)d,I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h n:tB202il:do53Ic:nk B(note: ltin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereoh crke (a202td | hI rd ex a. dx I) d, x .gRxru)on,uW pog(rrgkorEuolpue(pmg)er,no tu e(()n.crculnS(hwmee)m;. c o| m ^m .buffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppe:s8[:N1C:C Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereP ROTO _8S | IIMMPPLLE_]C/ONLCLC_LF_USNTCE(PASl/lsRiezdeuocfe(,T )C)O L{L N E| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ D I| R group(groupE CT, SIMPLE, Sum, int/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h6:4641_:t11): note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :641391 | : 95 : note: expanded from macro 'IMPL_COLL_FUNC' prim s391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]expanded from macro 'IMPL_COLL_FUNC' 391 | Ru n562W | o r k < ntcicdl(Ftuindc)#,# fnutnhcr,e atdysp(en,t hFruenacd#s#)d,e vtrieddIonpBt,h rNeCaCdLI_dAxL.GxO)_,# #garloguop,( gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ # #| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oto>(). r563u | n ( & n csctleSphSmiezme.(wnocrckl)S;h m\e m .| c ^o mm.buffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:s562[:N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'P ROTO_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I687d:x11.:x )note: ,in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here group(gro u687p | ) , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562i:m60s:( tnote: ifield 'group' will be initialized after field 'stepSize'd -tidSt a562r | t B c a stti,d (ntTihdr)e,a dnstBhcraesatd,s (&ndtihrreecatd-s>)o,u tt,i dnIunlBllpotcrk,( tahrrgesa-d>Isdexn.dxb)u,f fg,r oaurpg(sg-r>oruepc)v,b u f| f ^~~~~~~~~~~, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t e p Ssitzeep(Sniczcel(SnhcmcelmS.hcmoemmm..cboumfmf.SbiuzfefsS[iNzCeCsL[_NPCRCOLT_OP_RSOITMOP_LSEI]M/PNLCEC]L/_NSCTCELP_SS/TsEiPzSe/osfi(zTe)o)f ({T ) )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~{ | | group(group ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h655::66611::9 :note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655666 | | p rpirmism(st(itdi,d -ntTihdrSetaadrstGRaetdhuecre,, dniTrherceta-d>suRpe,d uNcUeL,L ,n ualrlgpst-r>,s e&nddibruefcft,- >out, args->.sewnodrbku)f;f ,\ a r| g ^s ->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :562 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid (202t | i d ) , n t h rReuandWso(rnktEhlreemaednst)<,F nt,i dTI,n BRleodcOkp(,t hArlegaod,I dPxr.oxt)o,> (g)r.oruupn((group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm.buff:S562i:z15e:s [warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C CL_PROTO_SIMPLE]/NCCL_S T562E | P S / s itziedo(ft(iTd))), {n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group( nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626g:r9o:u pnote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 626| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563p | r i m s (sttiedp-StiizdeS(tnacrctlSSchamtetme.rc,o mnmT.hbruefafdSsiSzceast[ter, NULL, direct->up,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h args->:s562en:d15b:u fwarning: finitializer order does not match the declaration order [-Wreorder-ctor], args->recvbuff, | ^ 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202(:t53i:d), nthreads(nth renote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s), tid I202n | B l o c k ( t h rReuandWIodrxk.Exl)e,me ngtr | ( ) . r usnt(ewpeS)i;z e (| n ^c clShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppm:m11.:b1u:f fnote: Sin instantiation of member function 'RunWork, 2, 2>::run' requested herei zes[NC C11L | _IPMRPOLT_OC_OSLILM_PFLUEN]C/(NAClClLR_eSTdEuPcSe/,s iCzOeLoLfN(ETT)_)D I{R E C| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, S| I group(groupM PLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::95655:: 11note: :expanded from macro 'IMPL_COLL_FUNC' note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | 655R | u n W o r k < n c c lpFruinmcs#(#tfiudn-ct,i dtSytpaer,t RFeudnucc#e#,d envTrherdeoapdc,e, NnCuClLl_pAtLrG,O _&#d#iarlegcot,- >NoCuCtL,_ PaRrOgTsO-_>#s#epnrdobtuof>f(,) .arrugns(-&>nrcecclvSbhumfefm,. w o| r ^k ); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunWork E562l | e m e n tt)(,) .triudnI(nwBel)o;c k (| t ^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppd:x11.:x1):, note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup(g r11o | uIpM)P,L _ C| O ^~~~~~~~~~~~~~~~~L L_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hU:N562C:(60A:l lnote: Rfield 'group' will be initialized after field 'stepSize'e duce, C562O | L L N ET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hth:r562e:a15d:s )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock(threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads), t i563d | I n B l osctke(ptShirzeea(dnIcdcxl.Sxh)m,e mg.rcooumpm(.gbruofufpS)i,z e s| [ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ PROTO_S I563M | P L E ] /sNtCeCpLS_iSzTeE(PnSc/csliSzhemoefm(.Tc)o)m m{. b u| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f S i| z group(groupe s[NCCL_PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_641S:T11E:P Snote: /in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres izeof(T) )641 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:a677r:t11R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, nThrea d677s | R e d u c e , d i rpercitm-s>(dtoiwdn-,t i&ddSitraercttB-c>aosutt,, naTrhgrse-a>dsseBncdabsutf,f ,& dairrgesc-t>-r>eocuvtb,u fdfi,r e c| t ^- >down, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s202-:>53s:e nnote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereb uff, a202r | g s - > r e c v bRuufnfW,o r k| E ^l ement, 2, 2>::run' requested hereO p, Al g202o | , P r o t o > (R)u.nrWuonr(kwEel)e;m e n| t ^< Fn, T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppO:p9,: 1A:l gnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here, Prot o9> | (I)M.PrLu_nC(OwLeL)_;F U N| C ^( AllReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppe:,6 :C1O:L Lnote: Nin instantiation of member function 'RunWork, 2, 2>::run' requested hereE T_DIR E6C | TI,M PSLI_MCPOLLEL,_ FSUuNmC,( AulilnRte6d4u_cte), C| O^L LNET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:R391E:C95T:, note: Sexpanded from macro 'IMPL_COLL_FUNC'I MPLE, Su m391, | i nRtu3n2W_otr)k < n| c^c lFunc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:f391u:n95c:, note: texpanded from macro 'IMPL_COLL_FUNC'y pe, Func# #391d | e v rReudnoWpoc,l FNuCnCcL#_#AfLuGnOc_,# #taylpgeo,, FNuCnCcL#_#PdReOvTrOe_d#o#pp>(,) .NrCuCnL(_&AnLcGcOl_S#h#maelmg.ow,o rNkC)C;L _\P R O| T ^O _##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:>562(:)15.:r unote: nfield 'nthreads' will be initialized after field 'tidInBlock'( &ncclSh m562e | m . w o rtki)d;( t\i d )| , ^ nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~I nBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(60t:h rnote: efield 'group' will be initialized after field 'stepSize'a dIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s), tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~d InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:id562):,15 :n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads(nthreads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); 562| | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppi:d6):,1 :n tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eads( n6t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gDrIoRuEpC(Tg,r oSuIpM)P,L E ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S u m| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) int32_t )563 | | ^ stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e391(:n95c:c lnote: Sexpanded from macro 'IMPL_COLL_FUNC'h mem.comm .391b | u f fRSuinzWeosr[kN ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N CCL _| A group(groupL GO_##algo, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:T666O:_9#:# pnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo to>().r u666n | ( & n c c l S h mpermi.mwso(rtki)d;, \n T h| r ^e ads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:a562t:h15e:r ,note: field 'nthreads' will be initialized after field 'tidInBlock'd irect- >562u | p , N UtLiLd,( tairdg)s,- >nstehnrdebaudfs(nthreads), tidInBlockf(,t harregasd-I>drxe.cxv)b,u fgfr,o u p| ( ^g roup), | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h53::562 :note: 60in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: field 'group' will be initialized after field 'stepSize' 202 | 562 | tRiudn(Wtoirdk)E,l enmtehnrtet(h)r.eraudnI(dwxe.)x;) , | g ^r oup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppu:p11):,1 : | note: ^~~~~~~~~~~in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S15i:z ewarning: (initializer order does not match the declaration order [-Wreorder-ctor]n cclShmem.comm.bu f562f | S i z e st[iNdC(CtLi_dP)R,O TnOt_hSrIeMaPdLsE(]n/tNhCrCeLa_dSsT)E,P St/isdiIzneBolfo(cTk)()t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I d x| . group(groupx ), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 655 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 11 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 655 | s t e p S i z e (pnrcicmlsS(htmiedm-.tciodmSmt.abrutfRfeSdiuzcees,[ NnCTChLr_ePaRdOsTROe_dSuIcMeP,L En]u/lNlCpCtLr_, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hem.wor:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(tid), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.had:ds562I):d,15x :.t xiwarning: )dinitializer order does not match the declaration order [-Wreorder-ctor],I ngBrlooucpk ((562gt | rh or ue pa )dt,Ii dd x(| .t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~xi )d ,)| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g rnot uh563pr | (e ga rd os u(spnt)te,hp rS ei| az ^~~~~~~~~~~~~~~~~de s()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn,:c 562ct:li60Sd:hI mnnote: eBfield 'group' will be initialized after field 'stepSize'ml .occ ok562m( | mt .h br ue fatfdiSIdid(zxte.isxd[))N,,C CgnLrt_ohPurRpeO(aTgdOrs_o(SunIptM)hP,rL eE a]| d/ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~sN )C ,C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t_ iSd TI563En | PB Sl /o sc iksz(teteohpfrS(eiTaz)de)I( dn{xc .c xl| )S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~,h m e| m group(group. cgormomu.pb(ugfr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hfo:Su666ip:z)9e,:s [ note: N| in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC ^~~~~~~~~~~ C L _666P | R O T O _ S I M PpLrEi]m/sN(CtCiLd_,S TnETPhSr/esaidzseGoaft(hTe)r), {d i r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c t -| > group(groupu p, NULL, args->sendbuff, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e687c:v11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buf:f, | ^ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here prims(tid- t202i | d S t a r t B c aRsutn,W onrTkhErleeamdesnBtc oAultg, on,u lPlrpottro,> (a)r.grsu-n>(sween)d;b u f| f ^, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppe:c11v:b1u:f fnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 11 | IMPL_COLL_FUNC(AllReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:O202L:L53N:E Tnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereD IRECT, S202I | M P L E , S u mR,u nfWlooraktE)l e m| e^n t | ( ) .RruunnW(owrek)<;n c c| l ^F unc##func,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :t11y:p1e:, note: Fin instantiation of member function 'RunWork, 2, 2>::run' requested hereu nc##d e11v | rIeMdPoLp_F,U NNCC(CALl_lARLeGdOu_c#e#,a lCgOoL,L NNECTC_LD_IPRREOCTTO,_ #S#IpMrPoLtEo,> (S)u.mr,u nf(l&onactc)l S h| m^e m.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :\391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :391562 | : 15 :R unote: nfield 'nthreads' will be initialized after field 'tidInBlock'W orkt,i dNICnCBLl_ock(threadIdx.x), group(group), A | L ^~~~~~~~~~~~~~~~~G O_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:l562g:o60,: Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PROT O562_ | # # p r ottiod>((t)i.dr)u,n (n&tnhcrcelaSdhmem.work); \ | ^ s(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( threadIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~e ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO _562# | # a l g ot,i dN(CtCiLd_), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s15-:> rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]c vbuff, | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d), nt h202r | e a d s ( n t h rReuandWso)r,k EtliedmIennBtlo(u)p.)r,u n (| w ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e ) ;| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 12 :s1t:e pnote: Sin instantiation of member function 'RunWork, 2, 2>::run' requested herei ze(nc c12l | SIhMmPeLm_.CcOoLmLm_.FbUuNfCf(SAilzleRse[dNuCcCeL,_ PCROOLTLON_ESTI_MDPILREE]C/TN,C CSLI_MSPTLEEP,S /Ssuimz,e odfo(uTb)l)e ){ | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: 391note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWork <655n | c c l F u n c # # f upnrci,m st(ytpied,- tFiudnSct#a#rdteRverdeudcoep,< tnyTpher>e,a dNsCRCeLd_uAcLeG,O _n#u#lallpgtor,, N&CdCiLr_ePcRtO-T>Oo_u#t#,p raorto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ( t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadIdx. x563) | , g r osutpe(pgSriozuep()n,c c l| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h m e| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). comm.bu f563f | S i z e ss[tNeCpCSLi_zPeR(OnTcOc_lSSIhMmPeLmE.]c/oNmCmC.Lb_uSfTfESPiSz/essi[zNeCoCfL(_TP)R)O T{O _ S| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| E group(group] /NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~666 : 9| : group(group note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n626T:h9r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres Gather, d i626r | e c t - > u p , pNrUiLmLs,( tairdg-st-i>dsSetnadrbtuSfcfa,t taerrg,s -n>TrhercevabdusfSfc,a t t| e ^r , NULL, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202c:t53-:> unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, args- >202s | e n d b u f f , RaurngWso-r>krEelcevmbeunftf<,F n ,| ^T , RedOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :A202l:g53o:, note: Pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer oto>( )202. | r u n ( w e ) ; R u| n ^W orkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppt:<7F:n1,: Tnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 RedOp, A | lgo, IPMrPoLt_oC>O(L)L._rFuUnN(Cw(eA)l;l R e| d ^u ce, COLLNET_DIRECT, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppI:M12P:L1E:, note: Sin instantiation of member function 'RunWork, 2, 2>::run' requested hereu m, uint 3122 | _ItM)P L _| C^O LL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:e391d:u95c:e ,note: expanded from macro 'IMPL_COLL_FUNC'C OLLNET_DIR E391C | T , RSuInMWPoLrEk,< nScucml,F udnocu#b#lfeu)n c ,| ^t ype, Fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:#391#:d95e:v rnote: eexpanded from macro 'IMPL_COLL_FUNC'd op, 391N | C C LR_uAnLWGoOr_k#<#naclcgloF,u nNcC#C#Lf_uPnRcO,T Ot_y#p#ep,r oFtuon>c(#)#.dreuvnr(e&dnocpcm,. wNoCrCkL)_;A L\G O _| # ^# algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:R562O:T15O:_ #note: #field 'nthreads' will be initialized after field 'tidInBlock'p roto>(). r562u | n ( & n ctcildS(htmiedm).,w onrtkh)r;e a\d s (| n ^t hreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n15B:l onote: cfield 'nthreads' will be initialized after field 'tidInBlock'k (threadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s (nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hreadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : group(group15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677 :t11i:d (note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d), nthr e677a | d s ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteBacdaIsdtx,. xn)T,h rgeraoduspB(cgarsotu,p )&,d i r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c t -| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o ut, dir e563c | t - > d oswtne,p Sairzges(-n>cscelnSdhbmuefmf.,c oamrmg.sb-u>frfeScivzbeusf[fN,C C L| _ ^P ROTO_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:E202]:/53N:C Cnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ STEPS/ s202i | z e o f ( T ) ) R{u n W| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r k E| l group(groupe ment, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, Proto>() .641r | u n ( w e ) ; | ^p rims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppd:-12t:i1d:S tnote: ain instantiation of member function 'RunWork, 2, 2>::run' requested herer tRedu c12e | ,I MnPTLh_rCeOaLdLs_RFeUdNuCc(eA,l ldRierdeuccte-,> dCoOwLnL,N E&Td_iDrIeRcEtC-T>,o uStI,M PaLrEg,s -S>usme,n ddbouufbfl,e )a r g| s^- >recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53391: | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR unWork <202n | c c l F u n c # #RfuunnWco,r ktEylpeem,e nFtuo,, NPCrCoLt_oA>L(G)O._r#u#na(lwgeo),; N C| C ^L _PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp#:p12r:o1t:o >note: (in instantiation of member function 'RunWork, 2, 2>::run' requested here) .run( &12n | cIcMlPSLh_mCeOmL.Lw_oFrUkN)C;( A\l l R| e ^d uce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:N562E:T15_:D Inote: Rfield 'nthreads' will be initialized after field 'tidInBlock'E CT, SIM P562L | E , S utmi,d (dtoiudb)l,e )n t h| r^e ads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s )note: ,expanded from macro 'IMPL_COLL_FUNC' tidInBloc k391( | t h rReuandWIodrxk. , N CtCiLd_(AtLiGdO)_,# #natlhgroe,a dNsC(CnLt_hPrReOaTdOs_)#,# ptriodtIon>B(l)o.crku(nt(h&rnecacdlISdhxm.exm).,w ogrrko)u;p (\g r o| u ^p ), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here a r7t | RIeMdPuLc_eC,O LnLT_hFrUeNaCd(sARleldRuecdeu,c ed,i rCeOcLtL-N>EdTo_wDnI,R E&CdTi,r eScItM-P>LoEu,t ,S uamr,g su-i>nste3n2d_btu)f f ,| ^a rgs->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:c391v:b95u:f fnote: ,expanded from macro 'IMPL_COLL_FUNC' | ^ 391 | RunWork, 2, 2>::run' requested here, type, Fu n202c | # # d e v r e d oRpuE,l eNmCeCnLt_().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, Algo:,562 :P15r:o twarning: oinitializer order does not match the declaration order [-Wreorder-ctor]> ().run(we); | ^ 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpph:r12e:a1d:s )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here tidInBloc k12( | tIhMrPeLa_dCIOdLxL._xF)U,N Cg(rAolulpR(egdruocuep,) ,C O L| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N E T| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)D IRECT, 563S | I M P L Es,t eSpuSmi,z ed(onucbclleS)h m e| m^. comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:S391i:z95e:s [note: Nexpanded from macro 'IMPL_COLL_FUNC'C CL_PROTO_ S391I | M P LREu]n/WNoCrCkL<_nScTcElPFSu/nsci#z#efoufn(cT,) )t y{p e ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F u n| c group(group# #devredop, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:A687L:G11O:_ #note: #in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea lgo, NC C687L | _ P R O T O _ # # p prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562e:,15 :n uwarning: linitializer order does not match the declaration order [-Wreorder-ctor]l ptr, &direct->out, a r562g | s - > s etniddb(utfifd,) ,a rngtsh-r>eraedcsv(bnutfhfr,e a d| s ^) , tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k202(:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered Idx.x )202, | g r o u p ( g rRouunpW)o,r k E| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m e| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t S(h)m.ermu.nc(owmem).;b u f| f ^S izes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppP:R12O:T1O:_ Snote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hereM PLE]/ N12C | CILM_PSLT_ECPOSL/Ls_iFzUeNoCf((ATl)l)R e{d u c| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, C| O group(groupL LNET_DIRECT, SIMPLE, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:,687 :d11o:u bnote: lin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ) | ^ 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' prims(tid -391t | i d SRtuanrWtoBrcka#oduetv,r enduolpla,r gNsC-C>Ls_eAnLdGbOu_f#f#,a lagrog,s -N>CrCeLc_vPbRuOfTfO,_ # #| p ^r oto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:&202n:c53c:l Snote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem em.wor k202) | ; \ | ^ RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:lement((t)i.dr)u,n (nwteh)r;e a d| s ^( nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 7t:i1d:I nnote: Bin instantiation of member function 'RunWork, 2, 2>::run' requested herel ock(t h7r | eIaMdPILd_xC.OxL)L,_ FgUrNoCu(pA(lglrRoeudpu)c,e , | C ^~~~~~~~~~~~~~~~~O LLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:T562_:D60I:R Enote: Cfield 'group' will be initialized after field 'stepSize'T , SIMP L562E | , S u mt,i du(itnitd3)2,_ tn)t h r| e^a ds(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBlock( t391h | r e aRduIndWxo.rxk)<,n cgcrloFuupn(cg#r#ofuupn)c,, t| y ^~~~~~~~~~~p e, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15&:n cwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]l Shmem.work); \ 562| | ^ tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd s(nthrea d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpnBlock(thre)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, : 562| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~15 : | warning: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)initializer order does not match the declaration order [-Wreorder-ctor] 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | group(group 563 | stepSize(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l677S:h11m:e mnote: .in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec omm.buff S677i | z e s [ N C C L _ P RpOrTiOm_sS(ItMiPdL-Et]i/dNSCtCaLr_tSBTcEaPsSt/,s inzTehorfe(aTd)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)60,: tnote: ifield 'group' will be initialized after field 'stepSize'd InBlock(thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads), t563i | d I n B lsotcekp(Stihzree(andcIcdlxS.hxm)e,m .gcroomump.(bgurfofuSpi)z,e s [| N ^~~~~~~~~~~C CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~d ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize'( nthrea d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e adIdx. x563) | , g r osutpe(pgSriozuep()n,c c l| S ^~~~~~~~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:&562n:c15c:l Swarning: hinitializer order does not match the declaration order [-Wreorder-ctor]m em.work); \ | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'( tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p (group )563, | | ^~~~~~~~~~~~~~~~~ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:S562i:z60e:( nnote: cfield 'group' will be initialized after field 'stepSize'c lShmem .562c | o m m . btuifdf(Stiizde)s,[ NnCtChLr_ePads(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:a15l:g owarning: ,initializer order does not match the declaration order [-Wreorder-ctor] NCCL_PROTO_##proto>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bnote: lfield 'nthreads' will be initialized after field 'tidInBlock'o ck(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s), ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R60O:T Onote: _field 'group' will be initialized after field 'stepSize'S IMPLE] /562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :g655r:o11u:p (note: gin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: pinitializer order does not match the declaration order [-Wreorder-ctor] roto>().run(&ncc l562S | h m e m .twiodr(kt)i;d )\, n| t ^h reads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rnote: eafield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x), group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), nthre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_)S,I M P| L ^~~~~~~~~~~~~~~~~E ]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60S:T Enote: Pfield 'group' will be initialized after field 'stepSize'S /sizeof (562T | ) ) { t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group) , nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I641n:B11l:o cnote: kin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( threadIdx .641x | ) , g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~- tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hds(nthread:s562):,15 :t iwarning: dIinitializer order does not match the declaration order [-Wreorder-ctor]n Block(threadIdx.x), group(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdI:n562B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadIdx.x), group(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nthre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_)S,I M P| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E ] /| N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_STEP S563/ | s i z e osft(eTp)S)i z{e ( n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l S| h group(groupm em.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C687C:L11_:P Rnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_SIMPLE ]687/ | N C C L _ S T E P S /psriizmeso(ft(iTd)-)t i{d S t| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r t B| c group(groupa st, nThreadsBcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h&:d666i:r9e:c tnote: -in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> out, nu l666l | p t r , a r g sp-r>ismesn(dtbiudf,f ,n Tahrrgesa-d>srGeactvhbeurf,f ,d i r| e ^c t->up, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hU:L202L:,53 :a rnote: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ->send b202u | f f , a r g s -R>urneWcovrbkuEflfe,m e n| t ^< Fn, T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:d202O:p53,: Anote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg o, Pr o202t | o > ( ) . r u n (Rwuen)W;o r k| E ^l ement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp<:F12n:,1 :T ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereR edOp, 12Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:o562r:k15):; warning: \initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563:562:60: note: | field 'group' will be initialized after field 'stepSize' stepSiz 562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:E562]:/N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]S TEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 | | group(group tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e687a:d11s:( nnote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads), t687i | d I n B l o c k ( t hprreiamdsI(dtxi.dx-)t,i dgSrtoaurpt(Bgcraosutp,) ,n T h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B cast, & d563i | r e c t -s>toeuptS,i zneu(lnlcpctlrS,h maermg.sc-o>msme.nbdubfuffSfi,z easr[gNsC-C>Lr_ePcRvObTuOf_fS,I M P| L ^E ]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:E202P:S53/:s inote: zin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree of(T) )202 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupR unWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg o, Prot o666> | ( ) . r u n ( w ep)r;i m s| ( ^t id, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppa:d13s:G1a:t hnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herer , dire c13t | -I>MuPpL,_ CNOULLLL_,F UaNrCg(sA-l>lsReenddubcuef,f ,C OaLrLgNsE-T>_rDeIcRvEbCuTf,f ,S I M| P ^L E, Sum, rc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:l202_:b53f:l onote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret 16) | 202^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :R95u:n Wnote: oexpanded from macro 'IMPL_COLL_FUNC'r kElement <391F | n , RTu,n WRoerdkO,( )t.yrpuen,( wFeu)n;c # #| d ^e vredop:,1 :N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _ALGO _13# | #IaMlPgLo_,C ONLCLC_LF_UPNRCO(TAOl_l#R#epdruocteo,> (C)O.LrLuNnE(T&_nDcIcRlESChTm,e mS.IwMoPrLkE),; S\u m ,| rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56353 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here stepS i202z | e ( n c c l S h mReumn.WcoormkmE.lbeumfefnStiL(_)S.TrEuPnS(/wsei)z;e o f| ( ^T )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp| : group(group5 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A666l:l9R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, COLL N666E | T _ D I R E C T ,p rSiImMsP(LtEi,d ,P rneTMhurleSaudms,G autihnetr8,_ td)i r e| c^t ->up, NU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:L391,: 95a:r gnote: sexpanded from macro 'IMPL_COLL_FUNC'- >sendbuf f391, | a rRgusn-W>orrekc, 2, 2>::run' requested here# #devr e202d | o p < t y p e > ,R uNnWorkElementP(R)O.TrOu_n#(#wper)o;t o >| ( ^) .run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppc:c4l:S1h:m enote: min instantiation of member function 'RunWork, 2, 2>::run' requested here. work) ;4 | \I M P| L ^_ COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:A562l:l15R:e dnote: ufield 'nthreads' will be initialized after field 'tidInBlock'c e, COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M PnLtEh,r ePardesM(unltShurme,ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement:(562):.15r:u nwarning: (initializer order does not match the declaration order [-Wreorder-ctor]w e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1 :562 | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid (5t | iIdM)P,L _nCtOhLrLe_aFdUsN(Cn(tAhlrleRaeddsu)c,e ,t iCdOILnLBNlEoTc_kD(ItRhErCeTa,d ISdIxM.PxL)E,, gPrroeuMpu(lgSruomu,p )u,i n t| 8 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ t )| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :s95t:e pnote: Sexpanded from macro 'IMPL_COLL_FUNC'i ze(ncclSh m391e | m . cRoumnmW.obrukff,( TN)C)C L{_ A L| G ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O _ #| # group(groupa lgo, NCCL_PROTO_##proto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:&655n:c11c:l Snote: hin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem em.work); 655\ | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562i:m15s:( tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd -tidSt a562r | t R e d utcied,( tniTdh)r,e andtshRreedaudcse(,n tnhurlelapdtsr),, &tdiidrIencBtl-o>coku(tt,h raeragdsI-d>xs.exn)d,b ugfrfo,u pa(rggrso-u>pr)e,c v b| u ^~~~~~~~~~~~~~~~~f f, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: 562note: | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we)_; | ^ DIREC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppT:,5 :S1I:M Pnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereE , PreM u5l | SIuMmP,L _iCnOtL3L2__FtU)N C (| A^l lReduce, COLLNET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:R391E:C95T:, note: Sexpanded from macro 'IMPL_COLL_FUNC'I MPLE, PreMulSum ,391 | u i nRtu8n_Wto)r k <| n^c clFunc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:f391u:n95c:, note: texpanded from macro 'IMPL_COLL_FUNC'y pe, Func #391# | d e vRruendWooprl,F uNnCcC#L#_fAuLnGcO,_ #t#yapleg,o ,F uNnCcC#L#_dPeRvOrTeOd_o#p#>,( )N.CrCuLn_(A&LnGcOc_l#S#hamlegmo.,w oNrCkC)L;_ P\R O T| O ^_ ##proto>().run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:c562c:l15S:h mnote: efield 'nthreads' will be initialized after field 'tidInBlock'm .work); \562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)60,: gnote: rfield 'group' will be initialized after field 'stepSize'o up(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~i d(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,60 :n tnote: hfield 'group' will be initialized after field 'stepSize'r eads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~a dIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hPL:E562]:/15N:C Cwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ STEPS/sizeof(T)) { | 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t687h:r11e:a dnote: sin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , tidIn B687l | o c k ( t h r e a d Ipdrxi.mxs)(,t igdr-otuipd(SgtraorutpB)c,a s t| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n T| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadsBca s563t | , & d isrteecptS-i>zoeu(tn,c cnluSlhlmpetmr.,c oamrmg.sb-u>fsfeSnidzbeusf[fN,C CaLr_gPsR-O>TrOe_cSvIbMuPfLfE,] / N| C ^C L_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:i202z:e53o:f (note: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here )) { 202| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Algo, Pr o655t | o > ( ) . r u n ( w ep)r;i m s| ( ^t id-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppa:r5t:R1e:d unote: cin instantiation of member function 'RunWork, 2, 2>::run' requested heree , nTh r5e | aIdMsPRLe_dCuOcLeL,_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 5: | 562I:M15P:L _warning: Cinitializer order does not match the declaration order [-Wreorder-ctor]O LL_FUNC(AllReduce, C O562L | L N E T _tDiIdR(EtCiTd,) ,S InMtPhLrEe,a dPsr(enMtuhlrSeuam,d su)i,n tt8i_dtI)n B l| o^c k(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391.:x95):, note: gexpanded from macro 'IMPL_COLL_FUNC'r oup(group )391, | | R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n W| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r kz,e sN[CNCCLC_LA_LPGROO_T#O#_aSlIgMoP,L EN]C/CNLC_CPLR_OSTTOE_P#S#/psriozteoo>f(()T.)r)u n{( & n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l S| h group(groupm em.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 626 | 562 | tipdr(itmisd()t,i dn-tthirdeSatdasr(tnStchartetaedrs,) ,n TthirdeIandBslSoccakt(ttehrr,e aNdUILdLx,. xd)i,r egcrto-u>pu(pg,r oaurpg)s->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL _PROTO_# #| p ^r oto>().run(&ncclShmem.work); \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | t iRdu(ntWiodr)k,E lnetmhernetah(r)e.arduInd(xw.ex));, g| r ^o up(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp):,7 : 1| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: 7note: | field 'group' will be initialized after field 'stepSize'I MPL_CO L562L | _ F U N Ct(iAdl(ltRiedd)u,c en,t hCrOeLaLdNsE(Tn_tDhIrReEaCdTs,) ,S ItMiPdLIEn,B lPorcekM(utlhSruema,d Iudixn.tx3)2,_ tg)r o u| p^( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^~~~~~~~~~~391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h5: | 562I:M15P:L _warning: Cinitializer order does not match the declaration order [-Wreorder-ctor]O LL_FUNC(AllReduc e562, | C O L LtNiEdT(_tDiIdR)E,C Tn,t hSrIeMaPdLsE(,n tPhrreeMaudlsS)u,m ,t iudiInntB8l_otc)k ( t| h^r eadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:x391):,95 :g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p(group) ,391 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R u n| W tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rks,[ NNCCCCLL__PAROLTGOO__S#I#MaPlLgEo],/ NNCCCCLL__SPTREOPTSO/_s#i#zperooft(oT>)()) .{r u n| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& n c| c group(groupl Shmem.work); \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :687:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 687 | 562 | tpirdi(mtsi(dt)i,d -nttihdrSetaads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rtBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562,: 15n:T hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adsBcast, &direct -562> | o u t , tdiidr(etcitd-)>,d onwn, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202562::5315:: note: warning: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 202 | RunWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r eadPs), tidInBlock(threraodtIod>x(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)6 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 563 | 6s | tIeMpPSLi_zCeO(LnLc_cFlUSNhCm(eAml.lcRoemdmu.cbeu,f fCSOiLzLeNsE[TN_CDCILR_EPCRTO,T OS_ISMIPMLPEL,E ]P/rNeCMCuLl_SSuTmE,P Si/nsti3z2e_otf)( T )) { | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :391655 | : 11 :R unote: nin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereW ork ,n TNhCrCeLa_dAsLRGeOd_u#c#ea,l gnou,l lNpCtCrL,_ P&RdOiTrOe_c#t#-p>rooutto,> (a)r.grsu-n>(s&enncdcbluSfhfm,e ma.rwgosr-k>)r;e c\v b u| f ^f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'nthreads' will be initialized after field 'tidInBlock': 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here562 | t202i | d ( t i d ) , nRtuhnrWeoardksE(lnetmhernetax()),. rgurno(uwpe()g;r o u| p ^) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp| : ^~~~~~~~~~~~~~~~~7 :1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here: 60: note: 7field 'group' will be initialized after field 'stepSize' | IMPL_C O562L | L _ F U NtCi(dA(ltliRde)d,u cnet,h rCeOaLdLsN(EnTt_hDrIeRaEdCsT),, StIiMdPILnEB,l oPcrke(MtuhlrSeuamd,I duxi.nxt)3,2 _gtr)o u p| (^g roup),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^~~~~~~~~~~95 : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork< n562c | c l F u ntci#d#f(utnicd,) ,t ynpteh,r eFaudnsc(#n#tdherveraeddso)p,< ttyipdeI>n,B lNoCcCkL(_tAhLrGeOa_d#I#daxl.gxo),, NgCrCoLu_pP(RgOrToOu_p#)#,p r o| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o > (| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). run(&nc c563l | S h m esmt.ewpoSrikz)e;( n\c c l| S ^h mem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:m562.:b15u:f fnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'i zes[NC C562L | _ P R O TtOi_dS(ItMiPdL)E,] /nNtChCrLeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: 562note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ti d202( | t i d ) , n t hRruenaWdosr(knEtlhermeeandts<)F,n ,t iTd,I nRBeldoO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp, Algo:, 562P:r15o:t owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(we); | ^ 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppn:t7h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here) , tidInB l7o | cIkM(PtLh_rCeOaLdLI_dFxU.NxC)(,A lglrRoeudpu(cger,o uCpO)L,L N E| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ D I| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)E CT, SI M563P | L E , PsrteeMpuSliSzuem(,n cucilnSth3m2e_mt.)c o m| m^. buffSizes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95P:R Onote: Texpanded from macro 'IMPL_COLL_FUNC'O _SIMPLE]/ N391C | C L _RSuTnEWPoSr/ks:,687 :N11C:C Lnote: _in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereA LGO_##al g687o | , N C C L _ P R O TpOr_i#m#sp(rtoitdo->t(i)d.Srtuanr(t&BnccacsltS,h mneTmh.rweoardks)B;c a\s t ,| ^& direct->out, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:u562l:l15p:t rnote: ,field 'nthreads' will be initialized after field 'tidInBlock' args->sen d562b | u f f , tairdg(st-i>dr)e,c vnbtuhfrfe,a d s| ( ^n threads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202t:i53d:I nnote: Bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel ock(t h202r | e a d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~~~~~~~T , Re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:,60 :A lnote: gfield 'group' will be initialized after field 'stepSize'o , Prot o562> | ( ) . r utni(dw(et)i;d ) ,| ^n threads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppn:t8h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here) , tidI n8B | lIoMcPkL(_tChOrLeLa_dFIUdNxC(.Axl)l,R egdruocuep,( gCrOoLuLpN)E,T _ D| I ^~~~~~~~~~~R ECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: cwarning: initializer order does not match the declaration order [-Wreorder-ctor] k(threadIdx.x), group(grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), nthreads(nthreads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.mx.)c,o mgmr.obuupf(fgSriozueps)[,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ P R| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T O_SIMPL E563] | / N C C Ls_tSeTpESPiSz/es(inzcecolfS(hTm)e)m .{c o m| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. b u| f group(groupf Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:o641f:(11T:) )note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 641 | prims(tid-tidStartReduce, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:h677r:e11a:d snote: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree duce, dir e677c | t - > d o w n , & dpirriemcst(-t>iodu-tt,i daSrtgasr-t>Bsceansdtb,u fnfT,h raeragdss-B>craesctv,b u&fdfi,r e c| t ^- >out, direct-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:d202o:w53n:, note: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer gs->se n202d | buff, args->recvbuff, | ^ RunWorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:m202e:n53t:< Fnote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, T, Re d202O | p , A l g o , RPurnoWtoor>k(E)l.ermuenn(tw (note: )in instantiation of member function 'RunWork, 2, 2>::run' requested here. run(we )7; | I M| P ^L _COLL_FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppl:l8R:e1d:u cnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested here, COLL N8E | TI_MDPILRECT, SIMP_LCEO,L LP_rFeUMNuCl(SAulml,R eudiunt32_t) | c^e , COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:D391I:R95E:C Tnote: ,expanded from macro 'IMPL_COLL_FUNC' SIMPLE, P r391e | M u lRSuunmW,o rikn , RNuCnCWLo_rAkLe(v)r.erduonp(<&tnycpcel>S,h mNeCmC.Lw_oArLkG)O;_ #\# a l| g ^o , NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15p:r onote: tfield 'nthreads' will be initialized after field 'tidInBlock'o >().run( &562n | c c l S htmiedm(.twiodr)k,) ;n t\h r e| a ^d s(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nnote: Bfield 'nthreads' will be initialized after field 'tidInBlock'l ock(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,60 :t inote: dfield 'group' will be initialized after field 'stepSize'I nBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:)562,: 15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads(nthreads), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t id/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hreadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~L _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .warning: xinitializer order does not match the declaration order [-Wreorder-ctor]) , group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dIdx.x), group(grou p562) | , | ^~~~~~~~~~~~~~~~~t id(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)60,: nnote: tfield 'group' will be initialized after field 'stepSize'h reads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~( ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: warning: initializer order does not match the declaration order [-Wreorder-ctor]: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdx.x),r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNL_PROTO_SIMPLE]/NCCL_STEPS/siCzCeLo_fP(RTO)T)O _{S I M| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L E ]| / group(groupN CCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677 : 11p:r inote: min instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (tid-tid S677t | a r t R e d u c e , pnrTihmrse(atdisdR-etdiudcSet,a rdtiBrceacstt-,> dnoTwhnr,e a&ddsiBrceacstt-,> o&udti,r eacrtg-s>-o>uste,n ddbiurfefc,t -a>rdgosw-n>,r eacrvgbsu-f>fs,e n d| b ^u ff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:r202e:c53v:b unote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref , | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u202n:W53o:r knote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel ement< F202n | , T , R e d ORpu,n WAolrgkoE,l ePmreontto<>F(n),. rTu,n (Rweed)O;p , | A ^l go, Proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp>:(9):.1r:u note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL:_A391L:G95O:_ #note: #expanded from macro 'IMPL_COLL_FUNC'a lgo, NCCL_PROTO_##proto>().r u391n | ( & nRcucnlWSohrmke, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562A:L15G:O _note: #field 'nthreads' will be initialized after field 'tidInBlock'# algo, NCCL_PROTO _562# | # p r o ttoi>d(()t.id), nrtuhnr(e&andcsc(lnSthhmreema.dwso)r,k )t;i d\I n B| l ^o ck(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~~~~~~~15 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIdxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~: 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562641::1511:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | 562 | p r itmisd((ttiidd-)t,i dnSttharretaRdesd(uncteh,r enaTdhsr)e,a dtsiRdeIdnuBcleo,c kd(itrherceta-d>Iddoxw.nx,) ,& dgirroeucpt(-g>roouutp,) ,a r g| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~- > s| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n dbuff, 563a | r g s - >srteecpvSbiuzfef(,n c c| l ^S hmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:S53i:z enote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here[ NCCL_ P202R | O T O _ S I M P LREu]n/WNoCrCkLE_lSeTmEePnSt/().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here655 | 9 | I M P Lp_rCiOmLsL(_tFiUdN-Ct(iAdlSltRaerdtuRceed,u cCeO,L LnNTEhTr_eDaIdRsERCeTd,u cSeI,M PnLuEl,l pPtrre,M u&ldSiurme,c tu-i>notu6t4,_ ta)r g s| -^> sendbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a391r:g95s:- >note: rexpanded from macro 'IMPL_COLL_FUNC'e /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :391562 | : 15 :R uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]W orkh,r eNaCdCsL(_nAtLhGrOe_a#d#sa)l,g ot,i dNICnCBLl_oPcRkO(TtOh_r#e#apdrIodtxo.>x()),. rgurno(u&pn(cgcrloSuhpm)e,m . w| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r k )| ; tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) \ | ^ 563 | stepSize(ncclShmem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:m562m:.15b:u fnote: ffield 'nthreads' will be initialized after field 'tidInBlock'S izes[NCCL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 687 :| 11 ^~~~~~~~~~~~~~~~~: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize' 687 | 562 | tpirdi(mtsi(dt)i,d -nttihdrSetaadrst(Bnctahsrte,a dnsT)h,r etaiddsIBncBalsotc,k (&tdhirreeacdtI-d>xo.uxt),, ngurloluppt(rg,r oaurpg)s,- > s| e ^~~~~~~~~~~n dbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStalrSthRmeedmu.cceo,m mn.TbhurfefaSdiszReesd[uNcCeC,L _nPuRlOlTpOt_rS,I M&PdLiEr]e/cNtC-C>Lo_uStT,E PaSr/gssi-z>esoefn(dTb)u)f f,{ a r| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s - | > group(groupr ecvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655 : 11 : note: Rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nWorkElementd(S)t.arrtuRne(dwuec)e;, n| T ^h readsReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppe:,8 :n1u:l lnote: pin instantiation of member function 'RunWork, 2, 2>::run' requested heret r, &di r8e | cItM-P>Lo_uCtO,L La_rFgUsN-C>(sAelnldRbeufdfu,c ea,r gCsO-L>LrNeEcTv_bDuIRfEfC,T , | S ^I MPLE, Pre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:u202l:S53u:m ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei nt64_t )202 | | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:W391o:r95k:E lnote: eexpanded from macro 'IMPL_COLL_FUNC'm ent#(#)f.urnucn,( wtey)p;e , | F ^u nc##devre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:o9p:<1t:y note: pin instantiation of member function 'RunWork, 2, 2>::run' requested heree >, NCC L9_ | AILMGPOL__#C#OaLlLg_oF,U NNCC(CALl_lPRReOTdOu_c#e#,p rCoOtLoL>N(E)T._rDuInR(E&CnTc,c lSSIhMmPeLmE.,w oPrrke)M;u l\S u | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: mnote: ,field 'nthreads' will be initialized after field 'tidInBlock' uint64_ t)562 | | ^ tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391n:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd s(nthread s391) | , tRiundWIonrBkl60,: Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_ALGO _#562# | a l g o ,t iNdC(CtLi_dP)R,O TnOt_h#r#epardost(on>t(h)r.eraund(s&)n,c ctliSdhImneBml.owcokr(kt)h;r e\a d I| d ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562z:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s677):,11 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI nBlock(threadI d677x | . x ) , g r o u p (pgrriomusp()t,i d -| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d S| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rtBcast ,563 | n T h r esatdespBSciazset(,n c&cdliSrhemcetm-.>cooumtm,. bduifrfeScitz-e>sd[oNwCnC,L _aPrRgOsT-O>_sSeInMdPbLuEf]f/,N CaCrLg_sS-T>ErPeSc/vsbiuzfefo,f ( T| ) ^) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m655e:n11t:, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here T, RedOp, A655l | g o , P r o t o > (p)r.irmusn((twied)-;t i d| S ^t artReduce, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpph:r8e:a1d:s Rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested hered uce, n8u | lIlMpPtLr_,C O&LdLi_rFeUcNtC-(>AolultR,e daurcges,- >CsOeLnLdNbEuTf_fD,I RaErCgTs,- >SrIeMcPvLbEu,f fP,r e M| u ^l Sum, int64_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 202 :| 53^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391202: | 95 : note: expanded from macro 'IMPL_COLL_FUNC' RunWo r391k | E l eRmuennWtou(n)c.#r#udne(vwree)d;o p <| t ^y pe>, NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppL:G9O:_1#:# anote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereg o, NC C9L_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :note: 15field 'nthreads' will be initialized after field 'tidInBlock': warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g60r:o unote: pfield 'group' will be initialized after field 'stepSize') , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid( t563i | d ) , nsttherpeSaidzse((nntchcrleSahdmse)m,. ctoimdmI.nbBulfofcSki(ztehsr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~P S/&sizeof(T)) d{i r e| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t - >| o group(groupu t, nullptr, args->sendbuff, args->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677| : ^11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:i202m:s53(:t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested here- tidStartBc a202s | t , n T h r e aRdusnBWcoarsktE,l e&mdeinrte oTu,t ,R eddiOrpe,c tA-l>gdoo,w nP,r oatrog>s(-)>.sreunnd(bwuef)f;, a| r ^g s->recvbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppf:f10,: 1 :| ^note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L202_:C53O:L Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereF UNC(Al l202R | e d u c e , C ORLuLnNWEoTr_kDEIlReEmCeTn,t | (^) .run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp: 9391: | 1 : Rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren Work< n9c | cIlMFPuLn_cC#O#LfLu_nFcU,N Ct(yAplel,R eFduuncce#,# dCeOvLrLeNdEoTp_T,, NSCICMLP_LAEL,G OP_r#e#MaullgSou,m ,N CuCiLn_tP6R4O_TtO)_ # #| p^r oto>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:n391c:c95l:S hnote: mexpanded from macro 'IMPL_COLL_FUNC'e m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkb,u fNfCSCiLz_eAsL[GNOC_C#L#_aPlRgOoT,O _NSCICMLP_LPER]O/TNOC_C#L#_pSrToEtPoS>/(s)i.zreuonf((&Tn)c)c l{S h m| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m . w| o group(groupr k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 641 | 562 | p r i m st(itdi(dt-itdi)d,S tnatrhtrReeaddusc(en,t hnrTehardesa)d,s RteidduIcneB,l odcikr(etchtr-e>addoIwdnx,. x&)d,i rgercotu-p>(ogurto,u pa)r,g s -| > ^~~~~~~~~~~~~~~~~s end/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f60,: anote: rfield 'group' will be initialized after field 'stepSize'g s->rec v562b | u f f , t i| d ^( tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hread s202) | , t i d I n B lRoucnkW(otrhkrEelaedmIednxt.().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl o10c | k(tIhMrPeLa_dCIOdLxL._xF)U,N Cg(rAolulpR(egdruocuep,) ,C O L| L ^~~~~~~~~~~~~~~~~N ET_DIRECT, SIMPLE, PreMulSum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562h:a60l:f )note: field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nthreads( n391t | h r eRaudnsW)o,r kt, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n15B:l owarning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (threadIdx.x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n thread s563( | n t h r esatdesp)S,i ztei(dnIcncBllSohcmke(mt.hcroemamd.Ibduxf.fxS)i,z egsr[oNuCpC(Lg_rPoRuOpT)O,_ S I| M ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P L E| ] tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/ NCCL_S T563E | P S / s iszteeopfS(iTz)e)( n{c c l| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h m e| m group(group. comm.buffSizes[NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:S687I:M11P:L Enote: ]in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/ NCCL_STE P687S | / s i z e o f ( T ) )p r{i m s| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| - group(groupt idStartBcast, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d655s:B11c:a snote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, &direct -655> | o u t , n u l l p tprr,i masr(gtsi-d>-steinddSbtuafrft,R educe,args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreadps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBlock(threadIdx.x )563, | g r o uspt(egprSoiuzpe)(,n c | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hthreads(nthrea:d562s:)15,: twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d InBlock(threadIdx.x), group(group), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(tid )563, | n t h rsetaedpsS(inzteh(rnecacdlsS)h,m etmi.dcIonmBml.obcukf(ftShirzeeasd[INdCxC.Lx_)P,R OgTrOo_uSpI(MgPrLoEu]p/)N,C C L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S TE P| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/ sizeof (563T | ) ) { s t| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p S i| z group(groupe (ncclShmem.comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h[:NC641C:L11_:P Rnote: Oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_SIMPLE ]641/ | N C C L _ S T E P S /psriizmeso(ft(iTd)-)t i{d S t| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r t R| e group(groupd uce, nThreadsReduce, direct->down, &direct->o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:t626,: 9a:r gnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >sendbuf f626, | a r g s - > r epcrvibmusf(ft,i d -| t ^i dStartScat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:e202r:,53 :n Tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eadsS c202a | t t e r , N U LRLu,n WdoirrkeEclte-m>eunpt,< Fanr,g sT-,> sReenddObpu,f fA,l gaor,g sP-r>orteoc>v(b)u.frfu,n ( w| e ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp::5312:: 1note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 202 | 12 | I M P L _ CROuLnLW_oFrUkNECl(eAmlelnRteL(E),.run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655t:i11d:( tnote: iin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nthre a655d | s ( n t h r e a d s )p,r itmisd(ItniBdl-otcikd(SttharretaRdeIdduxc.ex,) ,n Tghrroeuapd(sgRreoduupc)e,, n| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l l p| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r , &dir e563c | t - > o ustt,e paSrigzse-(>nscecnldSbhumfefm,. caormgms.-b>urfefcSvibzuefsf[,N C C| L ^_ PROTO_SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:]202/:N53C:C Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS TEPS/ s202i | z e o f ( T ) ) R{u n W| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r k E| l group(groupe ment11(:) .note: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(we); | 655 ^ | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:i10m:s1(:t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here- tidSt a10r | tIRMePdLu_cCeO,L Ln_TFhUrNeCa(dAsRedlulcRee,d uncuel,l pCtOrL,L N&EdTi_rDeIcRtE-C>To,u tS,I MaPrLgEs,- >PsreenMdubluSfufm,, ahraglsf-)> r e| c^v buff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: 391in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::677562::1115:: note: warning: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 677 | 562p | r i m s (ttiidd(-ttiidd)S,t anrtthBrceaasdts,( nntThhrreeaaddss)B,c atsitd,I n&Bdliorcekc(tt-h>roeuatd,I ddxi.rxe)c,t -g>rdoouwpn(,g raorugps)-,> s e| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d b u| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f , args -563> | r e c v bsutfefp,S i z| e ^( ncclShmem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:m202m:.53b:u fnote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS izes[N C202C | L _ P R O T O _ SRIuMnPWLoEr]k/ENlCeCmLe_nStT().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 677note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 12 | IMPL_C O677L | L _ F U N C ( A l l Rperdiumcse(,t iCdO-LtLiNdESTt_aDrItRBEcCaTs,t ,S InMTPhLrEe,a dPsrBecMausltS,u &md,i rdeocutb-l>eo)u t ,| ^d irect->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:o391w:n95,: anote: rexpanded from macro 'IMPL_COLL_FUNC'g s->sendb u391f | f , RaurngWso-r>kr, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreadsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15: warning: initializer order does not match the declaration order [-Wreorder-ctor] nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads),In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads ),562 | t i d I ntBildo(ctki(dt)h,r enatdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:v562b:u15f:f ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here tid(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp: 7563: | 1 : note: in instantiation of member function 'RunWork, 3, 2>::run' requested heres tepSiz e7( | nIcMcPlLS_hCmOeLmL._cFoUmNmC.(bAulflfRSeidzuecse[,N CCCOLL_LPNREOTT_OC_HSAIIMNP,L ES]I/MNPCLCEL,_ SSTuEmP,S /usiinzte3o2f_(tT)) ) | {^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: 391in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here | RunWo r916k | < n c c l F upnrci#m#sf(ugnrco,u ptTyipde,, gFruonucp#N#tdherveraeddso,p <&tryepcev>,, &NsCeCnLd_,A LaGrOg_s#-#>aslegnod,b uNfCfC,L _aPrRgOsT-O>_r#e#cpvrboutfof>,( ) .| r ^u n(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:h202m:e53m:. wnote: oin instantiation of member function 'RunWorkElement, 3, 2>::run' requested herer k); \ 202| | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hW:o562r:k15E:l enote: mfield 'nthreads' will be initialized after field 'tidInBlock'e nte(a)d.sr(unnt(hwree)a;d s )| , ^ tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cppc:k7(:t1h:r enote: ain instantiation of member function 'RunWork, 3, 2>::run' requested hered Idx.x) ,7 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~~~~~~~l lRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562e:,60 :C Onote: Lfield 'group' will be initialized after field 'stepSize'L NET_CHA I562N | , S I MtPiLdE(,t iSdu)m,, nutihnrte3a2d_st()n t h| r^e ads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hTE:P562S:/15s:i zwarning: einitializer order does not match the declaration order [-Wreorder-ctor]o f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a916d:s7(:n tnote: hin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads), t916i | d I n B l o cpkr(itmhsr(egardoIudpxT.ixd),, ggrroouuppN(tghrroeuapd)s,, &| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e c v| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) &send, a563r | g s - > ssetnedpbSuifzfe,( nacrcglsS-h>mreemc.vcboumfmf.,b u f| f ^S izes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereS IMPLE ]202/ | N C C L _ S T E PRSu/nsWiozrekoEfl(eTm)e)n t{< F n| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ T ,| group(groupR edOp, Algo, Proto>().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:)916;: 7 :| ^note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp916: | 9 : 1 : note: prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::916562::715:: note: warning: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 916 | prim s562( | g r o u ptTiidd(,t igdr)o,u pnNtthhrreeaaddss(,n t&hrreecavd,s )&,s etnidd,I naBrlgosc-k>(stehnrdebaudfIfd,x .axr)g,s -g>rroeucpv(bgurfofu,p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: 563note: | in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here st e202p | S i z e ( n c c lRSuhnmWeomr.kcEolmemm.ebnutfN(C)C.Lr_uSnT(EwPeS)/;s i z| e ^o f(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~11 : 1| : group(group note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hF:U916N:C7(:A lnote: lin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereR educe, C916O | L L N E T _ CpHrAiImNs,( gSrIoMuPpLTEi,d ,S ugmr,o ufplNotahtr)e a d| s^, &recv,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :&391s:e95n:d ,note: expanded from macro 'IMPL_COLL_FUNC'a rgs->send b391u | f f ,R uanrWgosr-k>, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)c##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:N562E:T15_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d swarning: (initializer order does not match the declaration order [-Wreorder-ctor]n threads), tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_tIn file included from * /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cppp:t1r: In file included from =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :r10e: cIn file included from v/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hP:t168r: (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h0:)153+:l14l:1 2warning: 8unused variable 'data1' [-Wunused-variable]O ffset; | ^~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dIdx.x), group(group), 562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group , group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ lgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h::386386::99:: warning: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable]variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386386 | | iinntt wwiirreeOOffffsseett == WWiirreeWWoorrddPPeerrSSlliiccee**wwaarrpp ++ 22**wwiidd;; | | ^ ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; In file included from | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:In file included from 514/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp::91:: In file included from warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hvariable 'offset' set but not used [-Wunused-but-set-variable]: 10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :514153 | : 14 : warning: iunused variable 'data1' [-Wunused-variable]n t offset = tid; | 153 ^ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, da/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht:a5142:,9 :f lwarning: avariable 'offset' set but not used [-Wunused-but-set-variable]g 2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h514: | 153 : 21 : iwarning: nunused variable 'flag1' [-Wunused-variable]t off s153e | t = tuiidn;t 3 2| _ ^t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, daIn file included from ta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp2:,1 : fIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ha:g102: ;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :| 169 ^~~~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h271::15319::28 :warning: unused variable 'ptr' [-Wunused-variable]warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_ t271 | d a t a 1 , f luaign1t,6 4d_att*a 2p,t rf l=a gr2e;c v P| t ^~~~~r (0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h):+153l:l351:2 8warning: Ounused variable 'flag2' [-Wunused-variable]f fset ;153 | | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NNCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->CsCeLn_dPbRuOfTfO,_ SaIrMgPsL-E>]r/eNcCvCbLu_fSfT,E P S| / ^s izeof(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):)202 :{53 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here | group(group 202 | RunWorkElement, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereo , Proto>().r u916n | ( w e ) ; p| r ^i ms(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cppT:i5:d1,: gnote: rin instantiation of member function 'RunWork, 3, 2>::run' requested hereo upNth r5e | aIdMsP,L _&CrOeLcLv_,F UN&Cs(eAnldl,R eadrugcse-,> sCeOnLdLbNuEfTf_,C HaArIgNs,- >SrIeMcPLvEb,u fSfu,m P o| s ^t Div, uint8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:t202): 53 :| ^note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h->send:b562u:f15f:, warning: ainitializer order does not match the declaration order [-Wreorder-ctor]r gs->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202562: | 53 : note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested heret id(tid )202, | n t h r e a d sR(unntWhroerakdEsl)e,m etnitdp(()g.rrouunp()w,e ) ;| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp563: | 8 : 1 : snote: tin instantiation of member function 'RunWork, 3, 2>::run' requested heree pSize( n8c | cIlMSPhLm_eCmO.LcLo_mFmU.NbCu(fAflSliRzeedsu[cNeC,C LC_OPLRLONTEOT__SCIHMAPILNE,] /SNICMCPLL_ES,T ESPuSm/PsoiszteDoifv(,T i)n)t 6{4_ t )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:W916o:r7k:< nnote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herec lFunc## f916u | n c , t y pper,i mFsu(ngcr#o#udpeTvirde,d ogprh,r eNaCdCsL,_ A&LrGeOc_v#,# a&lsgeon,d ,N CaCrLg_sP-R>OsTeOn_d#b#upfrfo,t oa>r(g)s.-r>urne(c&vnbcucflfS,h m e| m ^. work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :202 | note: field 'nthreads' will be initialized after field 'tidInBlock' R u562n | W o r k Etliedm(etnitdd(I)n.Brluonc(kw(et)h;r e a| d ^I dx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cppg:r7o:u1p:( gnote: rin instantiation of member function 'RunWork, 3, 2>::run' requested hereo up), 7| | ^~~~~~~~~~~~~~~~~I MPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:C562O:L60L:_ Fnote: Ufield 'group' will be initialized after field 'stepSize'N C(AllR e562d | u c e , tCiOdL(LtNiEdT)_,C HnAtIhNr,e aSdIsMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Runu,n cN#C#CfLu_nAcL,G Ot_y#p#ea,l gFou,n cN#C#CdLe_vPrReOdToOp_<#t#yppreo>t,o >N(C)C.Lr_uAnL(G&On_c#c#laSlhgmoe,m .NwCoCrLk_)P;R O\T O _| # ^# proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(tid), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~x ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:( gnote: rfield 'group' will be initialized after field 'stepSize'o up), | 562 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p ), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here ->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cppm:.1w: oIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:)10;: In file included from \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 167| : ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 60| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize'| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecs[kN(CtChLr_ePaRdOITdOx_.SxI)M,P LgEr]o/uNpC(CgLr_oSuTpE)P,S / s| i ^~~~~~~~~~~z eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:C note: Oexpanded from macro 'IMPL_COLL_FUNC'L LNET_CHAI N391, | S IRMuPnLWEo,r kM, NCCL_AL G391O | _ # #RaulngWoo,r kNy(p)e.,r uFnu(n&cn#c#cdleSvhrmeedmo.pw ,\ N C| C ^L _ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:#562a:l15g:o ,note: field 'nthreads' will be initialized after field 'tidInBlock'N CCL_PRO T562O | _ # # p rtoitdo(>t(i)d.)r,u nn(t&hnrcecaldSsh(mnetmh.rweoardks));, \t i d| I ^n Block(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15):, note: field 'nthreads' will be initialized after field 'tidInBlock'| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60562: | note: field 'group' will be initialized after field 'stepSize' tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~( grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 60 :| ^~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| ^~~~~~~~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::391562::9515:: note: warning: expanded from macro 'IMPL_COLL_FUNC'initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunW o562r | k < n c ctliFdu(ntci#d#)f,u nnct,h rteyapdes,( nFtuhnrce#a#ddse)v,r etdiodpIk,( tNhCrCeLa_dAILdGxO._x#)#,a lggroo,u pN(CgCrLo_uPpR),O T O| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~# # p| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o to>().r u563n | ( & n c csltSehpmSeimz.ew(onrckc)l;S h\m e m| . ^c omm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s15[:N Cnote: CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from 271/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp | : 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h : 10 : In file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hi:n168t: 64/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h_:t153*: 14p:t rwarning: unused variable 'data1' [-Wunused-variable]= recvPtr(0)+ll128Offs e153t | ; | ^~~u int32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :514514 | : 9 : warning: ivariable 'offset' set but not used [-Wunused-but-set-variable]n t offset =514 | t i d ; i n| t ^ offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a7 warnings generated when compiling for gfx906. lgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp(gr:o562up:)15,: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 562 | s t etpiSdi(ztei(dn)c,c lnSthhmreema.dcso(mnmt.hbruefafdSsi)z,e st[iNdCICnLB_lPoRcOkT(Ot_hSrIeMaPdLIEd]x/.NxC)C,L _gSrToEuPpS(/gsriozuepo)f,( T )| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ { | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 563 | stepSize(ncclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.916b:u7f:f Snote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herez es[NCCL _916P | R O T O _ S IpMrPiLmEs](/gNrCoCuLp_TSiTdE,P Sg/rsoiuzpeNotfh(rTe)a)d s{, &| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e c v| , group(group &send, args->sendbuff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:s916-:>7r:e cnote: vin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereb uff, | ^916 | prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(202g:r53o:u pnote: Tin instantiation of member function 'RunWorkElement, 3, 2>::run' requested herei d, gro u202p | N t h r e a d s ,R u&nrWeocrvk,E l&esmeenndt,< Fanr,g sT-,> sReenddObpu,f fA,l gaor,g sP-r>orteoc>v(b)u.frfu,n ( w| e ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp53::5 :note: 1in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 202 | 5 | I M P L _RCuOnLWLo_rFkUENlCe(mAelnltRL(E),. rMuinn(,w eu)i;n t 8| _ ^t ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of member function 'RunWork, 3, 2>::run' requested here: 95: note: expanded from macro 'IMPL_COLL_FUNC' 6 | IMPL_C O391L | L _ FRUuNnCW(oArlklt,) N C| C^L _ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:#391a:l95g:o ,note: expanded from macro 'IMPL_COLL_FUNC'N CCL_PROTO _391# | # p rRoutnoW>o(r)k. ,note: field 'nthreads' will be initialized after field 'tidInBlock'N CCL_ALG O562_ | # # a l gtoi,d (NtCiCdL)_,P RnOtThOr_e#a#dpsr(onttoh>r(e)a.drsu)n,( &tnicdcIlnSBhlmoecmk.(wtohrrke)a;d I\d x .| x ^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') , | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(w562e:)15;: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 562 | 6 | tIiMdP(Lt_iCdO)L,L _nFtUhNrCe(aAdllsR(endtucher,e aCdOsL)L,N ET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ lag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ lag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h :153386 | : 9 : warning: uvariable 'wireOffset' set but not used [-Wunused-but-set-variable]i nt32_t d386a | t a 1 , ifnlta gw1i,r edOaftfas2e,t f=l aWgi2r;e W o| r ^~~~~d Per/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hS:l153i:c35e:* wwarning: aunused variable 'flag2' [-Wunused-variable]r p + 2153* | w i d ; u i| n ^t 32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthreads(nthreads), tidInBlock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes)[,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| ] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _STEPS/sizeof(T) )563 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s t e| p group(groupS ize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hs:[34N:C7C:L _note: Pin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereR OTO_SIMPLE ]34/ | N C C L _ S TpErPiSm/ss(itziedo,f (nTt)h)r e{a d s| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ & r| i group(groupn g->prev, &ring->next, args->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:b34u:f7f:, note: ain instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer gs->recvbu f34f | , a r g s -p>rriemdsO(ptAirdg,, n0t,h raeragdss-,> c&orninnIgn-d>epxr,e va,r g&sr-i>ncgo-n>nnIenxdte,x )a;r g s| - ^>s endbuff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hg:s80-:>5r:e cnote: vin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereb uff ,80 | a r g s -r>urneRdiOnpgAoctoon>n(Ianrdgesx),; a r| g ^s ->connIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:e202x:)53;: note: | in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h : 80 : 5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested hereR unWo r80k | E l e m ernutnP(raortgos>)(;) . r| u ^n (we); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here: 4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here202 | 4 | I M PRLu_nCWOoLrLk_EFlUeNmCe(nRten(t)8._rtu)n ( w| e^) ; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here391 | Ru n10W | oIrMkPf,) NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s15-:> cwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]n nIndex, args->connInde x562 | ) ; | t ^i d(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h):,80 :n5t:h rnote: ein instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herea ds( n80t | h r e a drsu)n,R itnigdx(.axr)g,s )g;r o u| p ^( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 53 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 563202 | | s t e p SRiuzneW(onrckcEllSehmmeenmt.S(I)M.PrLuEn](/wNeC)C;L _ S| T ^E PS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppT:)13): 1{: note: | in instantiation of member function 'RunWork, 1, 2>::run' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 13 group(group | IMPL_COLL_FUNC(Reduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :R34I:N7G:, note: Sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI MPLE, Sum, 34r | c c l _ b f lporaitm1s6()t i d| ,^ nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391,: 95&:r inote: nexpanded from macro 'IMPL_COLL_FUNC'g ->prev, &391r | i n gR-u>nnWeoxrtk,< nacrcglsF-u>nsce#n#dfbuunfcf,, tayrpges,- >Fruenccv#b#udfefv,r eadrogps<-t>yrpeed>O,p ANrCgC,L _0A,L GaOr_g#s#-a>lcgoon,n INnCdCeLx_,P RaOrTgOs_-#>#cpornontIon>d(e)x.)r;u n (| & ^n cclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.he:m80.:w5o:r knote: )in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here; \ | 80 ^ | r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562R:i15n:g i(da(rtgisd));, n| t ^h reads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s )note: ,in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here tidIn B202l | o c k ( t h r e aRduIndWxo.rxk)E,l egmreonutp<(Fgnr,o uTp,) ,R e d| O ^~~~~~~~~~~~~~~~~p , A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:g562o:,60 :P rnote: ofield 'group' will be initialized after field 'stepSize't o>().r u562n | ( w e ) ;t i d| ( ^t id), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppe:a7d:s1(:n tnote: hin instantiation of member function 'RunWork, 1, 2>::run' requested herer eIn file included from a /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppd7:s | 1)I: ,MIn file included from P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.htL:i_10dC: IOIn file included from nL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hBL:l_167oF: cU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hkN:(C562t(:hR15re:ed auwarning: dcinitializer order does not match the declaration order [-Wreorder-ctor]Ie d,x .RxI)N ,G562 , | g rS oI uM pPt(LigEdr,(o tuSipud)m),,, un| it ^~~~~~~~~~~nh tr3e2a_dts)( n t| h^r eads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:I95n:B lnote: oexpanded from macro 'IMPL_COLL_FUNC'c k(threadI d391x | . x )R,u ngWroorukp<(ngcrcoluFpu)n,c # #| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n c| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) type, F u563n | c # # d esvtreepdSoipzl,S hNmCeCmL._cAoLmGmO._b#u#faflSgioz,e sN[CNCCLC_LP_RPORTOOT_O#_#SpIrMoPtLoE>](/)N.CrCuLn_(S&TnEcPcSl/Sshimzeemo.fw(oTr)k)) ;{ \ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: 562note: | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid(tid )34, | n t h r e apdrsi(mnst(htrieda,d sn)t,h rteiaddIsn,B l&orcikn(gt-h>rperaedvI,d x&.rxi)n,g -g>rnoeuxpt(,g raorugps)-,> s e| n ^~~~~~~~~~~~~~~~~d buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 :a60r:g snote: -field 'group' will be initialized after field 'stepSize'> recvbu f562f | , a r gtsi-d>(rteiddO)p,A rngt,h r0e,a dasr(gnst-h>rceoandnsI)n,d etxi,d IanrBglso-c>kc(otnhnrIenaddeIxd)x;. x )| , ^ group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h,: 80 :| 5 ^~~~~~~~~~~: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(args); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here id), n202th | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBg(r)o.urpu)n, ( w| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ; | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp : 7 : 1s:t enote: pin instantiation of member function 'RunWork, 1, 2>::run' requested hereS ize( n7c | cIlMSPhLm_eCmO.LcLo_mFmU.NbCu(fRfeSdiuzcees,[ NRCICNLG_,P RSOITMOP_LSEI,M PSLuEm],/ NuCiCnLt_3S2_TtE)P S /| s^i zeof(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):)391 :{95 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~expanded from macro 'IMPL_COLL_FUNC' | group(group 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h562:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor] 34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 56234 | | t i dp(rtiimds)(,t indt,h rnetahdrse(andtsh,r e&ardisn)g,- >prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->con/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | r562u | n R i n gts((anrtghsr)e;a d s| ) ^, tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c202k:(53t:h rnote: ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested herea dIdx. x202) | , g r o u p ( gRruonuWpo)r,k E l| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m e n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)< Fn, T, 563R | e d O p ,s tAelpgSoi,z eP(rnoctcol>S(h)m.ermu.nc(owmem).;b u f| f ^S izes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppC:L4_:P1R:O Tnote: Oin instantiation of member function 'RunWork, 1, 2>::run' requested here_ SIMP L4E | ]I/MNPCLC_LC_OSLTLE_PFSU/NsCi(zReeodfu(cTe),) R{I N G| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ S I| M group(groupP LE, Prod, int8_t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :34:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 34 | 391 | R u npWroirmks<(ntcicdl,F unntch##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_P R562O | T O _ S ItMiPdL(Et]i/dN)C,C Ln_tShTrEePaSd/ss(inztehorfe(aTd)s)) ,{ t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I n B| l group(groupo ck(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hp:(34g:r7o:u pnote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 34 | p563r | i m s ( tstepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidInBlock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~r eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreaop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]( tid), nthreads(nthreads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~c k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562e:x15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->connIndex); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h : 80 : 5t:id(tid), nthreads(nthreads), tidInBlock(thread Idx.note: x)in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~80 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) runRings(taerpgSsi)z;e ( n| c ^c lShmem.comm.buffSizes[NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T202O:_53S:I Mnote: Pin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereL E]/NCC L202_ | S T E P S / s i zReuonfW(To)r)k E{l e m| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t <| F group(groupn , T, RedOp, Algo, Proto>().run(we); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h ^: 34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp :8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 34 | 8 | I M P Lp_rCiOmLsL(_tFiUdN,C (nRtehdruecaed,s ,R I&NrGi,n gS-I>MpPrLeEv,, M&arxi,n gi-n>tn6e4x_tt,) a r| g^s ->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f391f:,95 :a rnote: gexpanded from macro 'IMPL_COLL_FUNC's ->recvbuf f391, | a rRgusn-W>orrekd ctoynpneI,n dFeuxn,c #a#rdgesv-r>ecdoonpn),; N C| C ^L _ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h#:a80l:g5o:, note: Nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereC CL_P R80O | T O _ # #rpurnoRtion>g(<)T.,r uRne(d&Onpc,c lPSrhomteom>.(waorrgks));; \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h53::562 :note: 15in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here: note: field 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | R u n Wtoirdk(Etliedm)e,n tno(c)k.(rtuhnr(ewaed)I;d x .| x ^) , group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cppg:r6o:u1p:) ,note: in instantiation of member function 'RunWork, 1, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ 6 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60_:C Onote: Lfield 'group' will be initialized after field 'stepSize'L _FUNC( R562e | d u c e ,t iRdI(NtGi,d )S,I MnPtLhE, Max, int32_t) | ^ r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95(:n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads), t i391d | I n BRluoncWko(rtkh, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h5::562 :note: 15in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 80 | runRing((tairdg)s,) ;n t h| r ^e ads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h::34562::715:: note: warning: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 34 | pri m562s | ( t i d ,t indt(htrieda)d,s ,n t&hrrienagd-s>prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hBl:o562c:k15(:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adIdx.x), group(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), nthr e563a | d s ( n tshtreepaSdisz)e,( ntcicdlISnhBmleomc.kc(otmhmr.ebaudfIfdSxi.zxe)s,[ NgCrCoLu_pP(RgOrToOu_pS)I,M P L| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~] / N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_STEP S563/ | s i z e osft(eTp)S)i z{e ( n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l S| h group(groupm em.comm.buffSizes[NCCL_PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h]:/34N:C7C:L _note: Sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT EPS/sizeo f34( | T ) ) { p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i m s| ( group(groupt id, nthreads, &r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:n34g:-7>:p rnote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herev , &ring-> n34e | x t , a r gpsr-i>msse(ntdibdu,f fn,t harregasd-s>,r e&crvibnugf-f>,p raervg,s -&>rriendgO-p>Anregx,t ,0 ,a ragrsg-s>-s>ecnodnbnuIfnfd,e xa,r gasr-g>sr-e>ccvobnunfIfn,d eaxr)g;s - >| r ^e dOpArg,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :080,: 5a:r gnote: sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here- >con n80I | n d e x ,r uanrRgisn-g>(arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hs:)80;: 5 :| ^note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202r:u53n:R inote: nin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereg R(uanrWgosr)k;E l e| m ^e nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:: 34note: :in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | 34 | p r i m sp(rtiimds,( tnitdh,r enatdhsr,e a&drsi,n g&-r>ipnrge-v>,p r&ervi,n g&-r>innegx-t>,n eaxrtg,s -a>rsgesn-d>bsuefnfd,b uafrfg,s -a>rrgesc-v>bruefcfv,b uafrfg,s -a>rrgesd-O>prAerdgO,p A0r,g ,a r0g,s -a>rcgosn-n>IcnodnenxI,n daerxg,s -a>rcgosn-n>IcnodnenxI)n;d e x| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :note: 80in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here: 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here80 | 80r | u n R i nrguP(raortgos>)(;a r g| s ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here202 :53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | 202 | R u n W o rRkuEnlWeomreknEtlP(r)o.trou>n(()w.er)u;n ( w| e ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp :5:1: 5note: | in instantiation of member function 'RunWork, 1, 2>::run' requested hereI MPL_C O5L | LI_MFPULN_CC(ORLeLd_uFcUeN,C (RRIeNdGu,c eS,I MRPILNEG,, MSiInM,P LuEi,n tM8i_nt,) u i| n^t 8_t) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h^: 391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunW o391r | k < nRcucnlWFournkc<#n#cfculnFcu,n ct#y#pfeu,n cF,u ntcy#p#ed,e vFruendco#p#d,o pNL,G ON_C#C#La_lAgLoG,O _N#C#CaLl_gPoR,O TNOC_C#L#_pPrRoOtToO>_(#)#.prruont(o&>n(c)c.lrSuhnm(e&mn.cwcolrSkh)m;e m\. w o| r ^k ); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 60| : ^~~~~~~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: 562note: | field 'group' will be initialized after field 'stepSize' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp| : ^~~1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, dataIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:19:: 386warning: :unused variable 'ptr' [-Wunused-variable]9 : warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 271 | 386 | i n tu iwnitr6e4O_ftf*s eptt r= =W irreecvWPotrrd(P0e)r+Sllli1c2e8*Owfafrspe t+; 2 *| w ^~~i d; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceSIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ catter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ecvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562p:r15i:m swarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id, nthreads, &ring -562> | p r e vt,i d&(rtiindg)-,> nnetxhtr,e aadrsg(sn-t>hsreenaddbsu)f,f ,t iadrIgnsB-l>orcekc(vtbhurfefa,d Iadrxg.sx-)>,r egdrOopuApr(gg,r o0u,p )a,r g s| - ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~> c o| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Index, a563r | g s - > csotnenpISnidzeex()n;c c l| S ^h mem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hm:.80b:u5f:f Snote: iin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herez es[N C80C | L _ P R OrTuOn_RSiInMgPz(eaorfg(sT));) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :20234 | : 7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWorkE l34e | m e n t < F np,r iTm,s (RteiddO,p ,n tAhlrgeoa,d sP,r o&troi>n(g)-.>rpurne(vw,e )&;r i n| g ^- >next, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp-:>7s:e1ndbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.htidInBloc:k562(:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 60 : tnote: ifield 'group' will be initialized after field 'stepSize'd (tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( group), 563 | | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562R:I15N:G ,warning: initializer order does not match the declaration order [-Wreorder-ctor]S IMPLE, PreMulSum, i562n | t 8 _ t )t i d| (^t id), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthread s391) | , tRiudnIWnoBrlko, N563C | C L _ A LsGtOe_p#S#iazleg(on,c cNlCSChLm_ePmR.OcToOm_m#.#bpurfoftSoi>z(e)s.[rNuCnC(L&_nPcRcOlTSOh_mSeImM.PwLoEr]k/)N;C C\L _ S| T ^E PS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f15(:T )note: )field 'nthreads' will be initialized after field 'tidInBlock' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562| | group(group tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h(:n34t:h7r:e anote: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ), tidInB l34o | c k ( t h r epardiImdsx(.txi)d,, gnrtohurpe(agdrso,u p&)r,i n g| - ^~~~~~~~~~~~~~~~~> pre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:, 562&r:i60n:g -note: >field 'group' will be initialized after field 'stepSize'n ext, args-> sendb562uf | f , a rtgisd-(>triedc)v,b unftfh,r eaardgss(-n>trherdeOapdAsr)g,, t0i,d IanrBglso-c>kc(otnhnrIenaddeIxd,x .axr)g,s -g>rcoounpn(Ignrdoeuxp));, | | ^ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 386 : 9u:i nwarning: t3variable 'wireOffset' set but not used [-Wunused-but-set-variable]2 _t data1, f l386a | g 1 , diantta 2w,i rfelOafgf2s;e t | = ^~~~~ Wi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hr:e153W:o28r:d Pwarning: eunused variable 'data2' [-Wunused-variable]r Slic e153* | w a r p u+i n2t*3w2i_dt; d a| t ^a 1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(aIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here rgs); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, ar7g warningss- generated> when compiling for rgfx908e. cvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadcIdx.x), ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~~~~~~~L _PROTO_SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:si562z:e60o:f (note: Tfield 'group' will be initialized after field 'stepSize') ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.ho:c149k:(62t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested herea dIdx.x), g149r | o u p ( g r oPurpi)m,i t i| v ^~~~~~~~~~~e s, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154 :c10o:p ywarning: Tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]o Shmem8( t154i | d % W A RcPa_sSeI Z3E:, d| s ^t , src, byte/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpps:)5;: 9 :| ^~~note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :5162 | : 5 : warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] M S162C | C L _ I MdPeLf_aKuElRtN:E L _| E ^~~~~~~N TR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hY:_165F:U33N:C _note: Duninitialized use occurs hereE VREDO P165_ | T Y P E (cSoupmy,T oiSnhtm8e_mt8,( tfiadl%sWeA)R;P _ S| I ^Z E, dst/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 402s:r3c:, note: bexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'y tes); | 402 ^~~ | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyTo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:h134m:e14m:8 (note: tinitialize the variable 'dst' to silence this warningi d%WAR P134_ | S I Z E ,v odisdt ,* dssrtc,, *bsyrtce;s ) ;| ^ | ^~~| = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h128, full:O154p:s10>:( cwarning: omvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]m , algo, work); \ | ^ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ unc##devredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hvariable 'wireOffset' set but not used [-Wunused-but-set-variable]: 386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | 386i | n t w iirnetO fwfisreetO f=f sWeitr e=W oWridrPeeWroSrldiPceer*Swlaircpe *+w a2r*pw id; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ E]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, fa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:s134e:)14;: note: | initialize the variable 'dst' to silence this warning ^ 134/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 405 : 3 :v onote: iexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd *dst, *src ;405 | | ^m s c| c = nullptrl RunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:t134e:p14S:i znote: einitialize the variable 'dst' to silence this warning( ncclS h134m | e m . c ovmomi.db u*fdfsSti,z e*ss[rNcC;C L _| P ^R O T| O = nullptr_ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, workIn file included from )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp;: 1\: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^13 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: warning: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.ht:63864:_9t:* warning: pvariable 'wireOffset' set but not used [-Wunused-but-set-variable]t r = recvPtr(0) +386l | l 1 2 8 Oifnfts ewti;r e O| f ^~~f set = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hd:at514a:19,: fwarning: lvariable 'offset' set but not used [-Wunused-but-set-variable]a g1, da t514a | 2 , f liangt2 ;o f f| s ^~~~~e t = /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht:i153d:;21 : | warning: ^unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(: aIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hg:s167): ; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered (tid), nthr e202a | d s ( n t h r e aRdusn)W,o rtkiEdlIenmBelnotcp()),. r u| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( w e| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T); | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpps:t4e:p1S:i znote: ein instantiation of member function 'RunWork, 1, 2>::run' requested here( nccl S4h | mIeMmP.Lc_oCmOmL.Lb_uFfUfNSC(iRzeedsu[cNeCSCcLa_tPtReOrT,O _RSIINMG,P LSEI]M/PNLCEC,L _PSrToEdP,S /isnitz8e_otf)( T )| )^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 group(group: 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heren c, type, F u33 | n c # # d e vprreidmosp(n,t hNrCeCaLd_sA,L G&Or_i#n#ga-l>gpor,e vN,C C&Lr_iPRnOgT-O>_n#e#xptr,o tao>r(g)s.-r>usne(n&dnbcucflfS,h maermg.sw->orrekc)v;b u\f f ,| ^a rgs->redOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:r562g:,15 :0 ,note: field 'nthreads' will be initialized after field 'tidInBlock'a rgs->conn I562n | d e x , tairdg(st-i>dc)o,n nnItnhdreexa)d;s ( n| t ^h reads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hI:n78B:l5o:c knote: (in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heret hrea d78I | d x . x )r,u ngRrionugp<(Tg,r oRuepd)O,p , | Proto>(args) ^~~~~~~~~~~~~~~~~; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: 562in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | t i202d | ( t i d ) , n tRhurneWaodrsk(Enltehmreenatd)(,) .grruonu(pw(eg)r;o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:u15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f (T)) { 563| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groups tepSize(ncclShmem.comm.buffSizes[NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hOT:O33_:S7I:M Pnote: Lin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereE ]/NCCL_STEPS /33s | i z e o f ( Tp)r)i m{s ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d , | n group(groupt hreads, &ring->prev, &ring->next, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hs:e33n:d7b:u fnote: fin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here, args->recv b33u | f f , a r gpsr-i>mrse(dtOipdA,r gn,t h0r,e aadrsg,s -&>rcionngn-I>npdreexv,, a&rrgisn-g>-c>onnenxItn,d eaxr)g;s - >| s ^e ndbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :a78r:g5s:- >note: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree cvbu f78f | , a r grsu-n>RriendgOconnIndex, args->connIndexO)p;, P| r ^o to>(arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hs:)78;: 5 :| ^note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:u202n:R53i:n gnote: , 1, 2>::run' requested hereT , RedOp ,202 | P r o t o > ( a rRgusn)W;o r k| E ^l ement, 1, 2>::run' requested hereA lgo, P r202o | t o > ( ) . r u nR(uwneW)o;r k E| l ^e ment, 1, 2>::run' requested hereA lg o12, | IPMrPoLt_oC>O(L)L._rFuUnN(Cw(eR)e;d u c| e ^S catter, RING/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp,: 6S:I1M:P Lnote: Ein instantiation of member function 'RunWork, 1, 2>::run' requested here, Prod ,6 | dIoMuPbLl_eC)O L L| _^F UNC(Redu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:e391S:c95a:t tnote: eexpanded from macro 'IMPL_COLL_FUNC'r, RI NG, SIMPL E391, | P rRoudn,W oirnkt<3n2c_ctl)F u n| c^# #func, t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hy:p391e:,95 :F unote: nexpanded from macro 'IMPL_COLL_FUNC'c ##devredo p391< | t y pReu>n,W oNrCkCd(o)p.n,c cNlCSChLm_eAmL.GwOo_r#k#)a;l g\o , | N ^C CL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562o:t15o:> (note: )field 'nthreads' will be initialized after field 'tidInBlock'. run(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, note: tfield 'nthreads' will be initialized after field 'tidInBlock'i dInBloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I60n:B lnote: ofield 'group' will be initialized after field 'stepSize'c k(threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60):, note: tfield 'group' will be initialized after field 'stepSize'i dInBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~) , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0In file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp :a1r: gIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h-:>10c: oIn file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:I167n: de/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:,562 :a15r:g swarning: -initializer order does not match the declaration order [-Wreorder-ctor]> connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h562: | 78 : 5 : tnote: iin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hered (tid )78, | n t h rreuandRsi(nngto(cakr(gtsh)r;e a d| I ^d x.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202u:p53(:g rnote: oin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereu p), | 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Run W563o | r k E l esmteenptSe(s)[.NrCuCnL(_wPeR)O;T O _| S ^I MPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppN:C4C:L1_:S Tnote: Ein instantiation of member function 'RunWork, 1, 2>::run' requested hereP S/siz e4o | fI(MTP)L)_ C{O L L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F U N| C group(group( ReduceScatter, RING, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hM:P33L:E7,: Snote: uin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herem , int8_t) 33| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:r391i:m95s:( tnote: iexpanded from macro 'IMPL_COLL_FUNC'd , nthrea d391s | , &RruinnWgo-r>kpunnecx,t ,t yapreg,s -F>usnecn#d#bduefvfr,e daorpgpree>c,v bNuCfCfL,_ AaLrGgOs_-#>#raeldgOop,A rNgC,C L0_,P RaOrTgOs_-#>#cpornontIon>d(e)x.,r uanr(g&sn-c>ccloSnhnmIenmd.ewxo)r;k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h::56278::155:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | 562 | r u ntRiidn(gth(raeragdss));, t| i ^d InBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here. x), gr o202u | p ( g r o u p ) ,R u n| W ^~~~~~~~~~~~~~~~~o rkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.In file included from x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp):,1 : gIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ho:u10p: (In file included from g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hr:o167u: p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 15| : ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nth,reads(nthre aSdIsM)P,L Et,i dSIunmB,l oicnkt(3t2h_rte)a d I| d^x .x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ rgs->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 134note: :in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here14 : note: initialize the variable 'dst' to silence this warning 5 | 134 | vMoSiCdC L*_dIsMtP,L _*KsErRcN;E L _| E ^N T R| Y = nullptr_ FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter , Pcraosteo L3L:1 2 8| , ^ fullOps>(comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp,: 5a:l9g:o ,note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herew ork); \ 5 | | ^ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork:, ProtoSimple<2, 2>>' requested hereu nc, type, Func##devredop <78t | y p e > ,r uNnCRCiLn_gA_(PaRrOgTsO)_;# # p| r ^o to>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunWorkElemen t562< | F n , Tt,i dR(etdiOdp),, Anltghor,e aPdrso(tnot>h(r)e.ardusn)(,w et)i;d I n| B ^l ock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cppx:.9x:)1,: gnote: rin instantiation of member function 'RunWork, 1, 2>::run' requested hereo up(gr o9u | pI)M,P L _| C ^~~~~~~~~~~~~~~~~O LL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:U562N:C60(:R enote: dfield 'group' will be initialized after field 'stepSize'u ceScatt e562r | , R I NtGi,d (StIiMdP)L,E ,n tMhirne,a dusi(nntt6h4r_eta)d s )| ,^ tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:I dnote: xexpanded from macro 'IMPL_COLL_FUNC'. x), group (391g | r o uRpu)n,W o r| k ^~~~~~~~~~~< ncclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmrgs->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_##pr o563t | o > ( ) .srtuenp(S&inzcec(lnSchcmleSmh.mweomr.kc)o;m m\. b u| f ^f Sizes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O15T:O _note: Sfield 'nthreads' will be initialized after field 'tidInBlock'I MPLE]/N C562C | L _ S T EtPiSd/(stiizde)o,f (nTt)h)r e{a d s| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h(:t33h:r7e:a dnote: Iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hered x.x), grou p33( | g roup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s (nthreads), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock(th r563e | a d I d xs.txe)p,S igzreo(unpc(cglrSohumpe)m,. c o| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m . b| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f fSizes[ N563C | C L _ P RsOtTeOp_SSiIzMeP(LnEc]c/lNSChCmLe_mS.TcEoPmSm/.sbiuzfefoSfi(zTe)s)[ N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P R O| T group(groupO _SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hS:T33E:P7S:/ snote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herez eof(T)) { 33| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(tid, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ht:h33r:e7a:d snote: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here &ring->pre v33, | & r i n g -p>rniemxst(,t iadr,g sn-t>hsreenaddbsu,f f&,r ianrgg-s>-p>rreevc,v b&urfifn,g -a>rngesx-t>,r eadrOgpsA-r>gs,e n0d,b uafrfg,s -a>rcgosn-n>Irnedcevxb,u fafr,g sa-r>gcso-n>nrIenddOepxA)r;g , | 0 ^, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h>:c78o:n5n:I nnote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree x, a r78g | s - > c ornunnIRnidnegx<)T;, R| e ^d Op, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hr:o78t:o5>:( anote: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereg s); 78| | ^ runRi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:g202<:T53,: Rnote: ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered Op, P r202o | t o > ( a r g s )R;u n W| o ^r kElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<:F202n:,53 :T ,note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereR edOp, A202l | g o , P r o t oR>u(n)W.orruknE(lweem)e;n t <| F ^n , T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppO:p8,: 1A:l gnote: oin instantiation of member function 'RunWork, 1, 2>::run' requested here, Prot o8> | (I)M.PrLu_nC(OwLeL)_;F U N| C ^( ReduceSca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppt:t8e:r1,: Rnote: Iin instantiation of member function 'RunWork, 1, 2>::run' requested hereN G, SI M8P | LIEM,P LM_aCxO,L Li_nFtU6N4C_(tR)e d u| c^e Scatte/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:,391 :R95I:N Gnote: ,expanded from macro 'IMPL_COLL_FUNC' SIMPLE, M a391x | , iRnutn6W4o_rtk)< n c| c^l Func##f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n391c:,95 :t ynote: pexpanded from macro 'IMPL_COLL_FUNC'e , Func##d e391v | r e dRoupnn,c cNlCFCuLn_cA#L#GfOu_n#c#,a ltgyop,e ,N CFCuLn_cP#R#OdTeOv_r#e#dporpop(e)>.,r uNnC(C&Ln_cAcLlGSOh_m#e#ma.lwgoor,k )N;C C\L _ P| R ^O TO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562o:t15o:> (note: )field 'nthreads' will be initialized after field 'tidInBlock'. run(&nc c562l | S h m e mt.iwdo(rtki)d;) ,\ n t| h ^r eads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hreadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s (nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:#15#:p rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]t o>().run(&ncclShmem.work); \562 | | ^ tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e adIdx.x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threads (563n | t h r e asdtse)p,S itzied(InncBclloSchkm(etmh.rceoamdmI.dbxu.fxf)S,i zgerso[uNpC(CgLr_oPuRpO)T,O _ S| I ^~~~~~~~~~~~~~~~~M PLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C60L:_ Snote: Tfield 'group' will be initialized after field 'stepSize'E PS/siz e562o | f ( T ) )t i{d ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ) ,| group(groupn threads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ho:c33k:(7t:h rnote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herea dIdx.x), g r33o | u p ( g r o uppr)i,m s (| t ^~~~~~~~~~~i d, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 : 15| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | stepSize(ncclS h562m | e m . c otmimd.(btuifdf)S,i znetsh[rNeCaCdLs_(PnRtOhreaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h : 33 : 7s:t enote: pin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereS ize(ncclSh m33e | m . c o m m .pbruifmfsS(itzieds,[ NnCtChLr_ePaRdOsT,O _&SrIiMnPgL-E>]p/rNeCvC,L _&SrTiEnPgS-/>sniezxeto,f (aTr)g)s -{> s e| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d b u| f group(groupf , args->recvbuff, args->redOpArg, 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h,: 33a:r7g:s -note: >in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herec onnIndex, a33r | g s - > c o npnrIinmdse(xt)i;d , | n ^t hreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h,: 78&:r5i:n gnote: -in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here> prev ,78 | & r i n gr-u>nnReixntg,< Ta,r gRse-d>Ospe,n dPbruoftfo,> (aarrggss-)>;r e c| v ^b uff, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h-:>202r:e53d:O pnote: Ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer g, 0, 202a | r g s - > c o n nRIunndWeoxr,k Ealregmse-n>tc (note: )in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here. run( w78e | ) ; | r ^u nRing, 1, 2>::run' requested hereP roto> (8a | rIgMsP)L;_ C O| L ^L _FUNC(Redu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:e202S:c53a:t tnote: ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer , RIN G202, | S I MPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:G562O:_15#:# awarning: linitializer order does not match the declaration order [-Wreorder-ctor]g o, NCCL_PROTO_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e adIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s (nthrea d563s | ) , t isdtIenpBSliozcek((ntchcrleSahdmIedmx..cxo)m,m .gbruofufpS(igzreosu[pN)C,C L _| P ^~~~~~~~~~~~~~~~~R OTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562I:M60P:L Enote: ]field 'group' will be initialized after field 'stepSize'/ NCCL_ST E562P | S / s i zteiodf((tTi)d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hd:InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: 9In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :13warning: : variable 'offset' set but not used [-Wunused-but-set-variable]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable]514 | int offset = ti d153; | | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: :warning: 154variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | 154 | c a scea s3e: 3 :| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp5::59::9 :note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herenote: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | 5 | M S CMCSLC_CILM_PILM_PKLE_RKNEERLN_EELN_TERNYT_RFYU_NFCU_NDCE_VDREEVDROEPD_OTPY_PTEY(PSEu(mS,u mu,i nuti3n2t_3t2,_ tf,a lfsael)s;e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::3405:: 3note: :expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | ms c405c | l R umnsIcnctleRrupnrIentteerrp,< tPyrpoet>o,L LP1r2o8t,o SfiumlpllOepS(CcCoLm_mC,H UaNlKgSoT,E PwSo/rMkS)C;C L\_ S L| I ^C ESTEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 165M:S33C:C Lnote: _uninitialized use occurs hereS LICESTE P165S | > , f uclolpOypTso>S(hcmoemmm8,( taildg%oW,A RwPo_rSkI)Z;E ,\ d s| t ^, src, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hb:y165t:e33s:) ;note: uninitialized use occurs here | ^~~ 165 | copyT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:S162h:m5e:m 8warning: (variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]t id% W162A | R P _ S IdZeEf,a udlstt:, s| r ^~~~~~~c , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hb:y165t:e33s:) ;note: uninitialized use occurs here | ^~~ 165 | copyTo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:h162m:e5m:8 (warning: tvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]i d%W A162R | P _ S I ZdEe,f adusltt,: s r| c ^~~~~~~, b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hy:t165e:s33):; note: uninitialized use occurs here| ^~~ 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134 :13414 | : note: initialize the variable 'dst' to silence this warning void *134d | s t , *vsoricd; * d| s ^t , | * = nullptrs rc; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:In file included from 169/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h1:: 509In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h29::13 : warning: In file included from field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 507 | tid(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr),e wid(tid%WARP_SIZE), awdasr(pn(tthirde/aWdAsR)P,_ StIiZdEI)n,B l o| c ~~~~~~~~~~~~~~~~~~k ( t| h stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)r eadId x508. | x ) , gwraoruppI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d I| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x .x/WARP _563S | I Z E ) ,s t e| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S i z| e warp(tid/WARP_SIZE( ncclS h509m | e m . c ofmlma.gbTuhfrfeSaidz(e(st[iNdC%C4L)_=P=R3O)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~P S /| s warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3i zeof(T) )510 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s t e| p group(groupS ize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:.217c:o57m:m .note: bin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereu ffSize s217[ | N C CPLr_iPmRiOtTiOv_eLsL<1T2,8 ]R/eNdCOCpL,_ SFTaEnPASs/ysmimzeetorfi(cu4,_ t1),) P{r o t| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, 0| > group(group prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:: 5note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 217 | P r5i | m i t i v e s < TM,S CRCeLd_OIpM,P LF_aKnEARsNyEmLm_eEtNrTiRcY<_1F,U1N>C,_ D1E,V RPErDoOtPo_,T Y0P>E (pSruimm,s u i| n ^t 32_t,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp :f5a:l9s:e )note: ;in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here | ^ 5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 405 : 3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' MSCCL_IMPL_ K405E | R N EmLs_cEcNlTRRuYn_IFnUtNeCr_pDrEeVtReErDs,e )P;r o t| o ^S impl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:<402M:S3C:C Lnote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'C HUNKSTE P402S | / M SmCsCcLc_lSRLuInCIEnStTeErPpSr,e tMeSrCe,v rfeudlolpOp(ec>o,m mP,r oatlogLoL,1 2w8o,r kf)u;l l\O p s| > ^( comm,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562l:g15o:, note: wfield 'nthreads' will be initialized after field 'tidInBlock'o rk); \ 562| | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here , 5 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE MSCCL_IMPL_KERNE L509_ | E N T R Yf_lFaUgNTCh_rDeEaVdR(E(DtOiPd_%T4Y)P=E=(3S)um,, gurionutp3(2g_rto,u pf)a,l s e| ) ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~; | | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 ^ 510 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:t402e:p3S:i znote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'( ncclShm e402m | . c ommsmc.cbluRfufnSIinzteesr[pNrCeCtLe_rPe,o fPr(outionLtL6142_8t,) )f u{l l O| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s > (| c group(groupo mm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *sr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:;154 : 10| : ^ warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]| = nullptr 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreadsIn file included from )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp,: 1t: i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I154n:B10l:o cwarning: kvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]( threadIdx.x), 154g | r o u p (cgarsoeu p3):, | | ^ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp :5625 | : 9 : note: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested herei d(tid) ,5 | n t h r e a d sM(SnCtChLr_eIaMdPsL)_,K EtRiNdEILn_BElNoTcRkY(_FtUhNrCe_aDdEIVdRxE.DxO)P,_ TgYrPoEu(pS(ugmr,o uipn)t,6 4 _| t ^~~~~~~~~~~, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple ,154 | f u l l Ocpass>e( c3o:m m ,| ^a lgo, work); \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp :5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'5 | 562 | M S C C Lt_iIdM(PtLi_KdE)R,N EnLt_hErNeTaRdYs_(FnUtNhCr_eDEaVdRsE)D,O Pt_iTdYIPnEB(lSoucmk,( tinhtr6e4a_dtI,d xf.axl)s,e )g;r o u| p ^( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 402 ^~~~~~~~~~~~~~~~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'562 | tid (402 | t i dm)s,c cnltRhurneIandtse(rnptrherteeard),, PgrrootuopL(Lg1r2o8,u pf)u,l l O| p ^~~~~~~~~~~s >(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:m154s: 10 :| ^warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5: 9154: | note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here case 53: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primiti134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ves, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:E134(:S14u:m ,note: initialize the variable 'dst' to silence this warningh alf, 134f | a l s e )v;o i d| ^* dst, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h*:s399r:c3;: note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ^ | = nullptr 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hY_FUNC_:DE154V:R10E:D Owarning: Pvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]_ TYPE(Sum, uint64_t, 154f | a l s e )c;a s e| ^3 : | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: 402in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here | msccl R5u | n I n t e r p r etMeSrCD,E VPRrEoDtOoLPL_1T2Y8P,E (fSuulml,O pusi>n(tc6o4m_mt,, aflaglos,e )w;o r k| ) ^; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ IZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'134 :14: note: initialize the variable 'dst' to silence this warning 402 | m134s | c c l void *dst, *src; | ^ | = nullptr RunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^ :154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr Ops>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, fl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oat, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h, :t154i:d10I:n Bwarning: lvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]o ck(threadI d154x | . x ) , cgarsoeup (3g:r o u| ^p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5: 9563: | note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here st e5p | S i z e ( n c c lMSShCmCeLm_.IcMoPmLm_.KbEuRfNfESLi_zEeNsT[RNYC_CFLU_NPCR_ODTEOV_RSEIDMOPPL_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here E]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:RY134_:F14U:N Cnote: _initialize the variable 'dst' to silence this warningD EVRED O134P | _ T Y P Ev(oSiudm ,* drsctc,l _*bsfrlco;a t 1| 6 ^ , | f = nullptra lse); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WA/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr RP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::13154: :In file included from 10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:: 167warning: : variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: 154initializer order does not match the declaration order [-Wreorder-ctor] | case 3: | ^ 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cppd:)5,: 9n:t hnote: rin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested heree ads(nth r5e | a d s ) , t i dMISnCBClLo_cIkM(PtLh_rKeEaRdNIEdLx_.ExN)T,R Yg_rFoUuNpC(_gDrEoVuRpE)D,O P _| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Y P E| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S um, rcc l563_ | b f l o astt1e6p,S ifzael(snec)c;l S h| m ^e m.comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h.:b405u:f3f:S inote: zexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'e s[NCCL_PRO T405O | _ S ImMsPcLcEl]R/uNnCICnLt_eSrTpErPeSt/esri, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp217::157: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ LL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cppIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ :1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ InBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 15 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | msccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:u134n:I14n:t enote: rinitialize the variable 'dst' to silence this warningp reterr,c ;P r o| t ^o S i| m = nullptrp le, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134| : ~~~~~~~~~~~~~~~~~~14 : | note: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)initialize the variable 'dst' to silence this warning 508134 | | wvaoripdI n*Bdlsotc,k (*tshrrce; a d| I ^d x .| x = nullptr/ WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cppe:r1p: rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:e13r: , P r562o | t o L L 1t2i8d,( tfiudl)l,O pnst>h(rceoamdms,( natlhgroe,a dwso)r,k )t;i d\I n B| l ^ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),In file included from tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cppo:c1k: (tIn file included from h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:e13a: dIn file included from I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:x168.: x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h,: 153g:r14o:u pwarning: (unused variable 'data1' [-Wunused-variable]g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563153 | | sutienptS3i2z_et( ndcactlaS1h,m efml.acgo1m,m .dbautfaf2S,i zfelsa[gN2C;C L _| P ^~~~~R OTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hz:e153o:f21(:T )warning: )unused variable 'flag1' [-Wunused-variable] { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 | | group(group uint32_t data1, flag1, data2, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:l217a:g572:; note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here| ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h: 153217: | 28 : Pwarning: runused variable 'data2' [-Wunused-variable]i mitiv e153s | < T , RueidnOtp3,2 _Fta ndAastyam1m,e tfrliacg<11,, 1d>a,t a12,, Pfrloatgo2,; 0 >| ^~~~~p rim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hs: 153 :| 35 ^: warning: unused variable 'flag2' [-Wunused-variable] 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp : 5 : 9u:i nnote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here3 2_t da t5a | 1 , f l a g 1 ,M SdCaCtLa_2I,M PfLl_aKgE2R;N E L| _ ^~~~~E NTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:I154n:t10e:r pwarning: rvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]e ter , ProtoLL, fullOps>(c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cppo:m5m:,9 :a lnote: gin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereo , work )5; | \ | ^ M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:C165C:L33_:I Mnote: Puninitialized use occurs hereL _KERNE L165_ | E N T R Yc_oFpUyNTCo_SDhEmVeRmE8D(OtPi_dT%YWPAER(PP_rSoIdZ,E ,u idnstt6,4 _str,c ,f ablystee)s;) ; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | cIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpps:e1 : 3In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 13| : ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 562 | 5 | t i d ( t i d )M,S CnCtLh_rIeMaPdLs_(KnEtRhNrEeLa_dEsN)T,R Yt_iFdUINnCB_lDoEcVkR(EtDhOrPe_aTdYIPdEx(.Pxr)o,d ,g rionutp6(4g_rto,u pf)a,l s e| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~; | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^ 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 402 : 3 :s tnote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'p Size(nc c402l | S h mmesmc.ccloRmumn.IbnutfefrSpirzeetse[rNP,S /PsriozteooLfL(1T2)8), {f u l| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O p s| > group(group( comm, algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hw:o217r:k57):; note: \in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here | ^ 217 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165P:r33i:m inote: tuninitialized use occurs herei vesR,P _1S,I ZPEr,o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]o , 0> prims | 154 ^ | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cppnote: :in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here5 :9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | 5 | M S C C LM_SICMCPLL__IKMEPRLN_EKLE_RENNETLR_YE_NFTURNYC__FDUENVCR_EDDEOVPR_ETDYOPPE_(TPYrPoEd(,P riondt,6 4i_ntt,6 4f_atl,s ef)a;l s e| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h3::405 :note: 3expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE': note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | 405m | s c cmlsRcucnlIRnutneIrnptreertperrey,p eP>r,o tPorSoitmopSliemE,P Sf>u,l lfOuplsl>O(pcso>m(mc,o maml,g oa,l gwoo,r kw)o;r k\) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h165::56233::15 :note: uninitialized use occurs herenote: field 'nthreads' will be initialized after field 'tidInBlock' 165 | 562 | c o p y TtoiSdh(mteimd8)(,t indt%hWrAeRaPd_sS(InZtEh,r edasdts,) ,s rtci,d IbnyBtleosc)k;( t h| r ^~~e adIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hg:r162o:u5p:) ,warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | ^~~~~~~~~~~~~~~~~ 162 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :d60e:f anote: ufield 'group' will be initialized after field 'stepSize'l t: | ^~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 165 : 33 :t inote: duninitialized use occurs here( tid), 165n | t h r e acdosp(ynTtohSrhemaedms8)(,t itdi%dWIAnRBPl_oScIkZ(Et,h rdesatd,I dsxr.cx,) ,b ygtreosu)p;( g r| o ^~~u p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ id; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 154warning: :initializer order does not match the declaration order [-Wreorder-ctor]10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 562 | t154i | d ( t i dc)a,s en t3h:r e a| d ^s (nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cppa:d5I:d9x:. xnote: )in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here, group (5g | r o u p ) , | M ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S C C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ IMPL_KE R563N | E L _ E NsTtReYp_SFiUzNeC(_nDcEcVlRSEhDmOePm_.TcYoPmEm(.Pbruofdf,S ihzaelsf[,N CfCaLl_sPROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | e group(group ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h57:: 402note: :in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 217 | Pri m402i | t i vmessc#,d e1v,r ePdroopt> ,p rPirmost o L| L ^1 28, f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cppu:l5l:O9p:s >note: (in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested herec omm, a5l | g o , w o r k )M;S C\C L _| I ^M PL_KE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:N165E:L33_:E Nnote: Tuninitialized use occurs hereR Y_FUNC_ D165E | V R E D OcPo_pTyYTPoES(hPmreomd8,( thiadl%fW,A RfPa_lSsIeZ)E;, d| s ^t , sr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:,405 :b3y:t enote: sexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE') ; | ^~~ 405 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:s162c:c5l:R uwarning: nvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]I nte r162p | r e t e rd, 165P | r o t o SciomppylTeo, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, ful/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :154:10: 134warning: | variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] voi d154 | * d s t ,c a*sser c3;: | | ^ ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::13162: :In file included from 5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:: 169warning: : variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h :509: 29162: | warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] default: | ^~~~~~~ 507/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 165 : 33 :t inote: duninitialized use occurs here( tid), 165n | t h r e acdosp(ynTtohSrhemaedms8)(,t iwdi%dW(AtRiPd_%SWIAZREP,_ SdIsZtE,) ,s rwca,r pb(yttieds/)W;A R P| _ ^~~S IZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::134217::1457:: note: note: initialize the variable 'dst' to silence this warningin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 134217 | | P rviomiidt i*vdesst<,T ,* sRrecd;O p ,| ^F a n| A = nullptrs ymmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr alf, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:In file included from 33/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:: 1note: : uninitialized use occurs hereIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :165167 | : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :c15o:p ywarning: Tinitializer order does not match the declaration order [-Wreorder-ctor]o Shmem8(tid%WAR P562_ | S I Z E ,t idds(tt,i ds)r,c ,n tbhyrteeasd)s;( n | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 15 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hk:)154;: 10\: warning: | variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 165154: | 33 : note: uninitialized use occurs herec ase 3: 165| | ^ copyToSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cppm:e5m:89(:t inote: din instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here% WARP_SI Z5E | , d s t , s rMcS,C CbLy_tIeMsP)L;_ K E| R ^~~N EL_ENTR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hY:_162F:U5N:C _warning: Dvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]E VRE D162O | P _ T Y PdEe(fParuoldt,: f l| o ^~~~~~~a t,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :f165a:l33s:e )note: ;uninitialized use occurs here | ^ 165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 405 : 3 :c onote: pexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'y ToShmem8(t i405d | % W AmRsPc_cSlIRZuEn,I ndtsetr,p rsertce,r , ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134 : 14c:o pnote: yinitialize the variable 'dst' to silence this warningT oShmem 8134( | t i d % WvAoRiPd_ S*IdZsEt,, d*sstr,c ;s r c| , ^ b y| t = nullptre s); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:c154c:l10R:u nwarning: Ivariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]n terpret e154r | < t y p ec,a sFeu n3c:# # d| e ^v redop, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cppP:r5o:t9o:S inote: min instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herep le_,T YfPuEl(lPOrposd>,( cdoomumb,l ea,l gfoa,l sweo)r;k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::3165:: 33note: :expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' note: uninitialized use occurs here 165 | 405 | mcsocpcylTRouSnhImnetme8r(ptriedt%eWrA , | P ^~~r otoSimp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:e162<:M5S:C Cwarning: Lvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]_ CHU N162K | S T E P Sd/eMfSaCuClLt_:S L I| C ^~~~~~~E ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:P165S:,33 :M Snote: Cuninitialized use occurs hereC L_SL I165C | E S T E PcSo>p,y TfouSlhlmOepms8>((tciodm%mW,A RaPl_gSoI,Z Ew,o rdks)t;, \s r c| , ^ byte/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)165;: 33 :| ^~~note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlockIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cppt:h1r: eIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I13d: xIn file included from ./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hx:)167,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:( gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d ( tsitde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBelso[cNkC(CtLh_rPeRaOdTIOd_xS.IxM)P,L Eg]r/oNuCpC(Lg_rSoTuEpP)S,/ s i| z ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e o f| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~563 | | group(group stepSize(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:l217S:h57m:e mnote: .in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herec omm.bu f217f | S i zPersi[mNiCtCiLv_ePsRf,( T1),) P{r o t| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, 0| > group(group prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:: 5note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 217 | P r5i | m i t i v e s < TM,S CRCeLd_OIpM,P LF_aKnEARsNyEmLm_eEtNrTiRcY<_1F,U1N>C,_ D1E,V RPErDoOtPo_,T Y0P>E (pPrriomds, d| o ^u ble, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cppf:a5l:s9e:) ;note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here | ^ 5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 405 : 3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' MSCCL_IMPL _405K | E R NmEsLc_cElNRTuRnYI_nFtUeNrCp_rDeEtVeRrEs,e )P;r o t| o ^S impl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:<405M:S3C:C Lnote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'C HUNKSTEPS/ M405S | C C Lm_sScLcIlCREuSnTIEnPtSe,r pMrSeCtCeLr_#,# dfeuvlrleOdposp><(tcyopmem>,, aPlrgoot,o Swiomrpkl)e;< M\S C C| L ^_ CHUNKS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:E562P:S15/:M Snote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_SLICE S562T | E P S , tMiSdC(CtLi_dS)L,I CnEtShTrEePaSd>s,( nftuhlrleOapdss>)(,c otmimd,I naBllgooc,k (wtohrrke)a;d I\d x .| x ^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') , | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g60r:o unote: pfield 'group' will be initialized after field 'stepSize') , | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WAR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr P_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h4)==3):,154 :g10r:o uwarning: pvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]( group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~154 | | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 case 3 :510 | | ^ stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cppc:o5m:m9.:b unote: fin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heref Sizes[NC C5L | _ P R O T O _ L LM1S2C8C]L/_NICMCPLL__SKTEERPNSE/Ls_iEzNeToRfY(_uFiUnNtC6_4D_EtV)R)E {D | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O P _| T group(groupY PE(Max, uint8_t, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:a217l:s57e:) ;note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here | ^ 217 | Pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:i402t:i3v:e snote: r,p r1e,t ePrrF upnrci#m#sd e v| r ^e dop5,: 9P:r onote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereo LL128, 5f | u l l O p s > ( cMoSmCmC,L _aIlMgPoL,_ KwEoRrNkE)L;_ E\N T R| Y ^_ FUNC_DE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hV:R165E:D33O:P _note: Tuninitialized use occurs hereY PE(Max, 165u | i n t 8 _cto,p yfTaolSshem)e;m 8 (| t ^i d%WARP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:S402I:Z3E:, note: dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE's t, src, 402b | y t emss)c;c l R| u ^~~n Interpretera,u lPtr:o t o| L ^~~~~~~L 128/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 165f:u33l:l Onote: puninitialized use occurs heres >(com m165, | a l g oc,o pwyoTrokS)h;m e\m 8 (| t ^i d%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9 386: | warning: variable 'offset' set but not used [-Wunused-but-set-variable] int wi r514e | O f intf soeftf s=e tW i=r etWiodr;d P e| r ^S lice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h*: 386p:t9r: =warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable]r ecvPtr(0) +386l | l 1 2 8 Oifnfts ewti;r e O| f ^~~f set = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h + 2*wid; : 386| : ^9 : warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytIn file included from es/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp):;1 : In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^~~: 13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t134i:d14%:4 )note: =initialize the variable 'dst' to silence this warning= 3), g r134o | u p ( g rvoouipd) ,* d s| t ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, *| s warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3r c; | ^ 510| | = nullptr stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRuIn file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cppI:n1t: eIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:r13e: tIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hr:<169t: y/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hp:e509,: 29F:u nwarning: cfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]# #devredop | , P r ottiodL(Lt1i2d8),, fnutlhlrOepasd>s((cnotmhmr,e aadlsg)o,, wwiodr(kt)i;d %\W A R| P ^_ SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1In file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp :d1a: tIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h2:,13 : fIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ha:g1692: ;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 271| : ^~~~~19 : warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, unused variable 'ptr' [-Wunused-variable]fl ag1, data2, flag 2271; | | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :u153i:n28t:6 4warning: _unused variable 'data2' [-Wunused-variable]t * ptr 153= | r e c vuPitnrt(302)_+tl ld1a2t8aO1f,f sfelta;g 1 ,| ^~~d ata2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :warning: 154variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | c a154s | e 3 : c a| s ^e 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp9::5 :note: 9in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | 5 | M S C CMLS_CICMPLL__IKMEPRLN_EKLE_RENNETLR_YE_NFTURNYC__FDUENVCR_EDDEVORPE_DTOYPP_ET(YMPaEx(,M axu,i nuti3n2t_3t2,_ tf,a lfsael)s;e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::3399:: 3note: :expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405399 | | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr , ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) In file included from 508 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cppw:ar1p: IIn file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hB:l13o: cIn file included from k/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h(:t169h: rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hd:I509d:x29.:x /warning: Wfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]A RP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 507 | tid(tid), nth r509e | a d s ( nftlhargeTahdrse)a,d (w(itdi(dt%i4d)%=W=A3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ RP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp386 | : 1 : In file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:t13 : wIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hr:e168O: f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hfs:e153t: 14=: Wwarning: iunused variable 'data1' [-Wunused-variable]r eWordPerSlice*warp + 2*wid; | 153 ^ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here oLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives: ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :1562,: 15P:r owarning: tinitializer order does not match the declaration order [-Wreorder-ctor]o , 0> prims | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp : 5t:i9d:( tnote: iin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hered ), nth r5e | a d s ( n t h r eMaSdCsC)L,_ ItMiPdLI_nKBElRoNcEkL(_tEhNrTeRaYd_IFdUxN.Cx_)D,E VgRrEoDuOpP(_gTrYoPuEp()M,a x ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i n t| 6 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)4 _t, fals e563) | ; | ^s tepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:z405e:(3n:c cnote: lexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'S hmem.comm. b405u | f f Smiszcecsl[RNuCnCILn_tPeRrOpTrOe_tSeIrM ,{ P r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t o S| i group(groupm ple, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereC ESTEPS ,217 | M S CPCrLi_mSiLtIiCvEeSsTR,e dfOupl,l OFpasn>A(scyommmmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cppvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp : 5 : 9 : note: Min instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested hereS CCL_IMP L5_ | K E R N E L _ E NMTSRCYC_LF_UINMCP_LD_EKVERRENDEOLP__ETNYTPREY(_MFaUxN,C _uDiEnVtR6E4D_OtP,_ TfYaPlEs(eM)a;x , | u ^i nt64_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 399f:a3l:s enote: )expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'; | ^ 399 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:s399c:c3l:R unote: nexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'I nterpre t399e | r < tmyspcec,l RFuunnIcn#t#edrepvrreetdeorp< ,F uPnrco#t#odLeLv,r efduolple(>c,o mPmr,o taolLgLo,, fwuolrlkO)p;s >\( c o| m ^m , algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :w165o:r33k:) ;note: uninitialized use occurs here\ | ^ 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :c165o:p33y:T onote: Suninitialized use occurs hereh mem8(t i165d | % W A R Pc_oSpIyZTEo,S hdmsetm,8 (stricd,% WbAyRtPe_sS)I;Z E ,| ^~~d st, src, bytes)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h;: 162 :| 5 ^~~: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | def/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:u162l:t5:: warning: | variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :162165 | : 33 : note: duninitialized use occurs heree fault :165 | | ^~~~~~~ co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:y165T:o33S:h mnote: euninitialized use occurs herem 8(tid %165W | A R P _ ScIoZpEy,T odSshtm,e ms8r(ct,i db%yWtAeRsP)_;S I Z| E ^~~, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^134 : 14| : = nullptr note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:d154(:In file included from t10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cppi::d 1%warning: : Wvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]In file included from A /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:P13 _: 154SIn file included from | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h Z: E167 ): ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc :aw562sa:er15 p:3( :twarning: iinitializer order does not match the declaration order [-Wreorder-ctor] d | / ^W ARP_SIZ E562) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp, : 5 :| 9 ~~~~~~~~~~~~~~~~~~ t: i | dnote: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)(in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here t i d508) | , 5 | n t hw ra er ap dI sn (BMnlStoChcCrkLe(_atIdhMsrP)eL,a_ dKtIEidRdxNI.EnxLB/_lWEoANcRTkPR(_YtS_hIFrZUeENa)Cd,_I Dd Ex| V. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Rx E) D,| O warp(tid/WARP_SIZEPg _rToYuPp E(509(g | Mr ao xu ,p )fu,li an gt| T6 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h4 r_ et| a, tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d (f(atl is563de | %) 4; ) = =| s3 ^t) e,p Sgi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hrz:oe405u(:pn3(c:gc rlnote: oSexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'uh pm)e,m . c| o405 ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~m | m . | bm warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3us fcfcSliRzu en510sI | [n Nt Ce Cr Lps_rtPeeRtpOeSTriOs(,[T N)PC)rC oL{t_ oP SR| iO ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~mT pO l_| eL group(group, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereEE PPSS/ /s217Mi | Sz Ce CoPLfr_(iSumLiiIntCtiE6vS4eT_sEt, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_D note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hFanAsymmetr:i154c:<110,:1 >,warning: 1variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized], Proto, 0> prims | ^154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:: 5note: :in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 55 | | MMSSCCCCLL__IIMMPPLL__KKEERRNNEELL__EENNTTRRYY__FFUUNNCC__DDEEVVRREEDDOOPP__TTYYPPEE((MMaaxx,, uuiinntt6644__tt,, ffaallssee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::3405:: 3note: :expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | 405 | m smcscclcRluRnuInnItnetreprrperteetree,> ,P rPortootLoLS1i2m8p,l efC(HcUoNmKmS,T EaPlSg/oM,S CwCoLr_kS)L;I C\E S T| E ^P S, MSCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:L165I:C33E:S Tnote: Euninitialized use occurs hereP S>, ful l165O | p s > ( ccoompmy,T oaSlhgmoe,m 8w(otrikd)%;W A\R P _| S ^I ZE, dst, sr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:,562 :b15y:t enote: sfield 'nthreads' will be initialized after field 'tidInBlock') ; | ^~~ 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):,162 :n5t:h rwarning: evariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]a ds( n162t | h r e a ddse)f,a utlitd:I n B| l ^~~~~~~o ck/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t165h:r33e:a dnote: Iuninitialized use occurs hered x.x), 165g | r o u p (cgorpoyuTpo)S,h m e| m ^~~~~~~~~~~~~~~~~8 (tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h%:W562A:R60P:_ Snote: Ifield 'group' will be initialized after field 'stepSize'Z E, dst, 562s | r c , btyitde(st)i;d ) ,| ^~~n threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr anAsymmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclIn file included from R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cppu:n1I: nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:r13p: rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ht:e168r: , P r153o | t o L L ,u ifnutl3l2O_pts >d(actoam1m,, fallaggo1,, wdoartka)2;, \f l a| g ^2 ; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::153165::2133:: warning: note: unused variable 'flag1' [-Wunused-variable]uninitialized use occurs here 153 | 165 | u i n tc3o2p_ytT odSahtmae1m,8 (ftliadg%1W,A RdPa_tSaI2Z,E ,f ldasgt2, src, bytes;) ; | ^~~~~| ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hwarning: :unused variable 'data2' [-Wunused-variable]162 :5: warning: 153variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | 162u | i nt 3 2 _dte fdaautlat1:, f| l ^~~~~~~a g1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 165d:a33t:a 2note: ,uninitialized use occurs here flag2 ;165 | | ^~~~~ c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ho:p153y:T35o:S hwarning: munused variable 'flag2' [-Wunused-variable]e m8(t i153d | % W A R Pu_iSnItZ3E2,_ td sdta,t as1r,c ,f lbaygt1e,s )d;a t a| 2 ^~~, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, PrIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr otoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tidin, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ %WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32In file included from _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cppt: 1d: aIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:113,: In file included from f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hl:a169g: 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h,: 271d:a19t:a 2warning: , unused variable 'ptr' [-Wunused-variable]f lag2; | ^~~~~ 271 | uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht:64153_:t21*: pwarning: tunused variable 'flag1' [-Wunused-variable]r = recvPtr( 0153) | + l l 1 2u8iOnftf3s2e_tt; d a| t ^~~a 1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h :386:9 :153 | warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] uint32 _386t | d a t ai1n,t fwliarge1O,f fdsaetta 2=, WfilraegW2o;r d P| e ^~~~~r Sli/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hc:e153*:w35a:r pwarning: unused variable 'flag2' [-Wunused-variable]+ 2*wi d153; | | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> primIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr s | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h | ^ :386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::154154::1010:: warning: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | 154 | c a scea s3e: 3 :| ^ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp :5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9 :5 | note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here MSC C5L | _ I M P L _ K E RMNSECLC_LE_NITMRPYL__FKUENRCN_EDLE_VERNETDROYP__FTUYNPCE_(DMEiVnR,E DuOiPn_tT6Y4P_Et(,M ifna,l suei)n;t 6 4| _ ^t , false); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^402 :3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: 402note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' mscclRu n402I | n t emrspcrceltReurnv,r ProtoLL128, fullOepdso>p( ,a lPgroo,t owLoLr1k2)8;, \f u l| l ^O ps>(comm,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :a165l:g33o:, note: wuninitialized use occurs hereo rk); \ | 165 ^ | co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:y165T:o33S:h mnote: euninitialized use occurs herem 8(tid%WA R165P | _ S I Z Ec,o pdysTto,S hsmrecm,8 (btyitde%sW)A;R P _| S ^~~I ZE, dst, src, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hb:y162t:e5s:) ;warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | ^~~ 162 | d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:f162a:u5l:t :warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | ^~~~~~~ 162/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 165 : 33 :d enote: funinitialized use occurs herea ult: 165| | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :c165o:p33y:T onote: Suninitialized use occurs hereh mem8(t i165d | % W A R Pc_oSpIyZTEo,S hdmsetm,8 (stricd,% WbAyRtPe_sS)I;Z E ,| ^~~d st, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 134note: :initialize the variable 'dst' to silence this warning14 : note: initialize the variable 'dst' to silence this warning 134 | 134 | v o i dv o*idds t*,d s*ts,r c*;s r c| ; ^ | | ^ = nullptr | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp::3991:: 3In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :note: 13expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE': In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 399 | mscc l507R | u n I n tteirdp(rteitde)r,< tnytpher,e aFdusn(cn#t#hdreads), wievredop, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr d(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' In file included from 402/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp | : 1 : mIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:c13l: RIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:I167n: te/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:p562r:e15t:e rwarning: t,i dP(rtoitdo)L,L 1n2t8h,r efaudlsl(Onptsh>r(ecaodmsm),, atligdoI,n Bwloorckk)(;t h\r e a| d ^I dx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 165g:r33o:u pnote: (uninitialized use occurs hereg roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 165 | 563 | c o p ysTtoeSphSmiezme8((ntcicdl%SWhAmRePm_.ScIoZmEm,. bdusftf, src, bytes); | ^~~ Siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:s162[:N5C:C Lwarning: _variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]P ROTO_S I162M | P L E ] /dNeCfCaLu_lStT:E P S| / ^~~~~~~s iz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:o165f:(33T:) )note: uninitialized use occurs here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 165 | copyToShmem8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t217i:d57%:W Anote: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereP _SIZE ,217 | d s tP,r ismrict,i vbeyst, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL1E2P8S,/ MfSuClClLO_pSsL>I(CcEoSmTmE,P Sa,l gMoS,C CwLo_rSkL)I;C E\S T E| P ^S >, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid),id/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] :154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: 154| | ^ case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp :55 | : 9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here MSCCL _5I | M P L _ K E R N EMLS_CECNLT_RIYM_PFLU_NKCE_RDNEEVLR_EEDNOTPR_YT_YFPUEN(CM_iDnE,V RiEnDtO6P4__TtY,P Ef(aMlisne,) ;i n t| 6 ^4 _t, fa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:s405e:)3;: note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 402405: | 3 : mnote: sexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'c clRunInt e402r | p r emtsecrc#,# dPervorteodSoipmS,C CPLr_oCtHoULNLK1S2T8E,P Sf/uMlSlCOCpLs_>S(LcIoCmEmS,T EaPlSg,o ,M SwCoCrLk_)S;L I\C E S| T ^E PS>, ful/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:O165p:s33>:( cnote: ouninitialized use occurs herem m, algo ,165 | w o r k )c;o p\y T o| S ^h mem8(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h%:W165A:R33P:_ Snote: Iuninitialized use occurs hereZ E, dst, s165r | c , b yctoepsy)T;o S h| m ^~~e m8(tid%WARP_SIZE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :d162s:t5,: swarning: rvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]c , b y162t | e s ) ; d e| f ^~~a ult: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 162note: :uninitialized use occurs here5 : warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 165 | 162 | c o pdyeTfoaSuhlmte:m 8 (| t ^~~~~~~i d%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hW:A165R:P33_:S Inote: Zuninitialized use occurs hereE , dst ,165 | s r c , cboyptyeTso)S;h m e| m ^~~8 (tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 134134: | 14 : note: initialize the variable 'dst' to silence this warningv oid * d134s | t , * svroci;d *| d ^s t ,| = nullptr* src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::113: : In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::13169: : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::509167:: 29/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 507 | tid(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dwsi(dn(tthirde%aWdAsR)P,_ StIiZdEI)n,B lwoacrkp((tthirde/aWdAIRdPx_.SxI)Z,E )g,r o u| p ~~~~~~~~~~~~~~~~~~( g r| o stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)u p), | 508 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) warpInB l563o | c k ( t hsrteeapdSIidzxe.(xn/cWcAlRSPh_mSeImZ.Ec)o,m m .| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| S warp(tid/WARP_SIZEi zes[N C509C | L _ P R OfTlOa_gSTIhMrPeLaEd](/(NtCiCdL%_4S)T=E=P3S)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~| group(group | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 217 : 57 :s tnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herep Size( n217c | c l SPhrmiemmi.tciovmems.N,C C1L,_ SPTrEoPtSo/,s i0z>e opfr(iumisn t 6| 4 ^_ t)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp : 5| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~9 : | note: group(groupin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 217 : 57 :M Snote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereC L_IMP L217_ | K E RPNrEiLm_iEtNiTvReYs_4,_ t1,, fParlosteo),; 0 >| ^p rims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 405| : ^3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 405 | m s5c | c l R u n I n t eMrSpCrCeLt_eIrMD,O PP_rToYtPoES(iMmipnl,e n,I nftuelrlpOrpest>e(rc, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562t:o15L:L 1note: 2field 'nthreads' will be initialized after field 'tidInBlock'8 , full O562p | s > ( c otmimd,( taildg)o,, nwtohrrke)a;d s\( n t| h ^r eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:t134e:r14p:r enote: tinitialize the variable 'dst' to silence this warninge r , | P ^r o t| o = nullptrL L, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp507: | 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :t13i: dIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ht:i169d: )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h,: 509n:t29h:r ewarning: afield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]d s(nthreads), wid(tid% W507A | R P _ S ItZiEd)(,t iwda)r,p (nttihdr/eWaAdRsP(_nStIhZrEe)a,d s )| , ~~~~~~~~~~~~~~~~~~ w i| d stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)( tid%W A508R | P _ S I ZwEa)r,p IwnaBrlpo(ctki(dt/hWrAeRaPd_ISdIxZ.Ex)/,W A R| P ~~~~~~~~~~~~~~~~~~_ S I| Z stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)E ), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 | | warp(tid/WARP_SIZE war p509I | n B l o cfkl(atghTrheraedaIdd(x(.txi/dW%A4R)P=_=S3I)Z,E )g,r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( g r| o warp(tid/WARP_SIZEu p), 509 | | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ f| l warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3a gThread (510( | t i d % 4s)t=e=p3S)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~b u f| f warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3S izes[NC C510L | _ P R O TsOt_eLpLS1i2z8e](/nNcCcClLS_hSmTeEmP.Sc/osmimz.ebouff(fuSiinzte6s4[_NtC)CL)_ P{R O T| O ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ L L| 1 group(group2 8]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:z217e:o57f:( unote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested heren t64_t) )217 | { P| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i m i| t group(groupi ves, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herer ic<1,1 217> | , 1P,r iPmriottiov,e s0<>T ,p rRiemdsO p ,| ^F anAsymme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppt:r5i:c9<:1 ,note: 1>in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here, 1, Pr o5t | o , 0 > p r iMmSsC C L| _ ^I MPL_KERN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppE:L5_:E9N:T Rnote: Yin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here_ FUNC_D E5V | R E D O P _ T Y PMES(CMCiLn_,I MiPnLt_6K4E_RtN,E Lf_aElNsTeR)Y;_ F U| N ^C _DEVRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hD:O402P:_3T:Y Pnote: Eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'( Min, in t4026 | 4 _ tm,s cfcallRsuen)I;n t e| r ^p reterl,R uPnrIonttoeLrLp1r2e8t,e rfF(ucnocm#m#,d eavlrgeod,o pw ,\ P r| o ^t oLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ecvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp = nullptr: 1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIIn file included from Z/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cppE:,1 : dIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:,13 : sIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hc:,167 : b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hy:t562e:s15):; warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: 154in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here | ca s5e | 3 : | ^ MSCCL_IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cppK:E5R:N9E:L _note: Ein instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested hereN TRY_FU N5C | _ D E V R E D O PM_STCYCPLE_(IMMiPnL,_ KhEaRlNfE,L _fEaNlTsReY)_;F U N| C ^_ DEVREDO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:_405T:Y3P:E (note: Mexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'i n, half, fa l405s | e ) ;m s c| c ^l RunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ lRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ MSCCL_CHUNKSTEPS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h 165 | : 154 : 10 :c owarning: pvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]y ToShmem8( t154i | d % W A RcPa_sSeI Z3E:, d| s ^t , src, bytes);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp : 5| : ^~~9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:y134T:o14S:h mnote: einitialize the variable 'dst' to silence this warningm 8(tid %134W | A R P _ SvIoZiEd, *ddsstt,, s*rscr,c ;b y t| e ^s ) ;| = nullptr | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = W/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr P_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 6, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/clang++ -fPIC -pipe -frecord-gcc-switches -Wall -g -O2 -parallel-jobs=16 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -Xlinker --dependency-file=CMakeFiles/rccl.dir/link.d -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_MinElapsed time (seconds): 821.751 _rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.1.40093 --hip-link --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 /usr/lib/llvm-rocm/lib64/clang/17/lib/linux/libclang_rt.builtins-x86_64.a -lpthread -lrt -ldl /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Built target rccl gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.66076 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/rccl-buildroot + : + /bin/rm -rf -- /usr/src/tmp/rccl-buildroot + PATH=/usr/libexec/rpm-build:/usr/src/bin:/usr/bin:/bin:/usr/local/bin:/usr/games + cd rccl-2.18.6 + DESTDIR=/usr/src/tmp/rccl-buildroot + cmake --install x86_64-alt-linux --verbose -- Install configuration: "" -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1.0 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-1pass.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets-noconfig.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl/LICENSE.txt + rm -rf /usr/src/tmp/rccl-buildroot/usr/rccl + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/rccl-buildroot (auto) mode of './usr/lib64/librccl.so.1.0' changed from 0755 (rwxr-xr-x) to 0644 (rw-r--r--) Verifying and fixing files in /usr/src/tmp/rccl-buildroot (binconfig,pkgconfig,libtool,desktop,gnuconfig) Checking contents of files in /usr/src/tmp/rccl-buildroot/ (default) Compressing files in /usr/src/tmp/rccl-buildroot (auto) Adjusting library links in /usr/src/tmp/rccl-buildroot ./usr/lib64: (from :0) librccl.so.1 -> librccl.so.1.0 Verifying ELF objects in /usr/src/tmp/rccl-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) section [ 3] '.dynsym': symbol 338 (__hip_fatbin): symbol in dynamic symbol table with non-default visibility verify-elf: WARNING: ./usr/lib64/librccl.so.1.0: eu-elflint failed Splitting links to aliased files under /{,s}bin in /usr/src/tmp/rccl-buildroot Processing files: librccl1-2.18.6-alt0.1 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.8634 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + DOCDIR=/usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + export DOCDIR + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + /bin/mkdir -p /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + cp -prL README.md LICENSE.txt NOTICES.txt CHANGELOG.md /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R go-w /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R a+rX /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.leTI74 find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) lib.prov: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1: 191 symbols, 18 bpp Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.8ubh8t find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) warning: librccl1 provides another subpackage: rccl Provides: rccl = 2.18.6-alt0.1, librccl.so.1()(64bit) = set:ldySY8WxOALBnhFpKYr8hTuOp4f4mGu2jLdMJjcZCXM47UXuwyyGRGWXKgETcgdjMi5wuDQ3qOxtZBm81J7pYPMIU1D4ctQkKefUrjndPqhuFfak8KACxDBZ2WZJDfvJzZ89VmVuIkNiinUuRvgP2VSlpiViW0mDiqb8i3YJossrximfgU5FDIg3bfAM3p87RAKcG4MZinBzsSGNgsBCROo9k0v79172vNT21EO938Mcw8Tz6KgGdnaHvvzgmTvhhNQWFQoI4SSRedfYZyMcS4HABqmacW4xzCUZaO5x9LSUxVFl0qy5C7FFGgAn04Hyxww4hPwz6LsL4UDEnEe2dpGZx29zB56rIHYGcZG1BqjQafIX1WE3sbDhXCpfBjMq4 Requires: ld-linux-x86-64.so.2()(64bit) >= set:jiids, ld-linux-x86-64.so.2(GLIBC_2.3)(64bit), libamdhip64.so.6()(64bit) >= set:mgEl4iHah5shPP2z5A5zYttYI7XpZyRnhe1J6ZgwULwPlWeYZ4XbZd2bItRMqeW4hZmmUYmDZdpDnrYqkUKOuzfUwKzIyQItN97gggSsa6v6KYBa3m70aJ49gh1ckMQcuEPMZKgWZw, libamdhip64.so.6(hip_4.2)(64bit), libamdhip64.so.6(hip_4.3)(64bit), libamdhip64.so.6(hip_4.5)(64bit), libamdhip64.so.6(hip_5.0)(64bit), libamdhip64.so.6(hip_5.3)(64bit), libamdhip64.so.6(hip_6.0)(64bit), libc.so.6(GLIBC_2.14)(64bit), libc.so.6(GLIBC_2.17)(64bit), libc.so.6(GLIBC_2.2.5)(64bit), libc.so.6(GLIBC_2.3)(64bit), libc.so.6(GLIBC_2.3.2)(64bit), libc.so.6(GLIBC_2.3.4)(64bit), libc.so.6(GLIBC_2.33)(64bit), libc.so.6(GLIBC_2.34)(64bit), libc.so.6(GLIBC_2.38)(64bit), libc.so.6(GLIBC_2.6)(64bit), libgcc_s.so.1(GCC_3.0)(64bit), libm.so.6(GLIBC_2.2.5)(64bit), librocm_smi64.so.1()(64bit) >= set:miSwa9ZECgdMsH9hGiyEU5mNQ1, libstdc++.so.6(CXXABI_1.3)(64bit), libstdc++.so.6(CXXABI_1.3.5)(64bit), libstdc++.so.6(CXXABI_1.3.7)(64bit), libstdc++.so.6(GLIBCXX_3.4)(64bit), libstdc++.so.6(GLIBCXX_3.4.11)(64bit), libstdc++.so.6(GLIBCXX_3.4.18)(64bit), libstdc++.so.6(GLIBCXX_3.4.19)(64bit), libstdc++.so.6(GLIBCXX_3.4.21)(64bit), libstdc++.so.6(GLIBCXX_3.4.22)(64bit), libstdc++.so.6(GLIBCXX_3.4.29)(64bit), rtld(GNU_HASH) Requires(rpmlib): rpmlib(SetVersions) Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.FyU12G Creating librccl1-debuginfo package Processing files: librccl-devel-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.ASG1QV find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.gM97B1 find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:12: /usr/include/hip/hip_runtime.h:66:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 66 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:70: /usr/include/hip/hip_runtime_api.h:8852:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 8852 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:71: /usr/include/hip/library_types.h:75:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 75 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:73: /usr/include/hip/hip_vector_types.h:38:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 38 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:13: /usr/include/hip/hip_fp16.h:33:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 33 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ cpp.req: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed, trying c++ mode x86_64-alt-linux-cpp: fatal error: cannot execute 'cc1plus': posix_spawnp: No such file or directory compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h:10:10: fatal error: nccl.h: No such file or directory 10 | #include "nccl.h" | ^~~~~~~~ compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h: cpp failed Provides: rccl-devel = 2.18.6-alt0.1 Requires: /usr/lib64/librccl.so.1 Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.9TktXl Processing files: librccl1-debuginfo-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.a8fK7u find-provides: running scripts (debuginfo) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.jbjjyg find-requires: running scripts (debuginfo) Provides: debug64(librccl.so.1) Requires: librccl1 = 2.18.6-alt0.1, debug64(ld-linux-x86-64.so.2), debug64(libamdhip64.so.6), debug64(libc.so.6), debug64(libgcc_s.so.1), debug64(libm.so.6), debug64(librocm_smi64.so.1), debug64(libstdc++.so.6) Adding to librccl1-debuginfo a strict dependency on librccl1 Adding to librccl-devel a strict dependency on librccl1 Removing 1 extra deps from librccl-devel due to dependency on librccl1 Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-2.18.6-alt0.1.x86_64.rpm (w2T8.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl-devel-2.18.6-alt0.1.x86_64.rpm (w2T8.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm (w2.lzdio) 20618.61user 674.37system 46:10.66elapsed 768%CPU (0avgtext+0avgdata 5534188maxresident)k 1318160inputs+0outputs (87436major+78965607minor)pagefaults 0swaps /.out/librccl1-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl-devel-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // 11.43user 3.78system 48:43.87elapsed 0%CPU (0avgtext+0avgdata 138324maxresident)k 2636096inputs+0outputs (0major+311335minor)pagefaults 0swaps --- librccl-devel-2.18.6-alt0.1.x86_64.rpm.repo 2024-08-13 08:56:50.000000000 +0000 +++ librccl-devel-2.18.6-alt0.1.x86_64.rpm.hasher 2025-02-09 06:20:11.968867850 +0000 @@ -45,3 +45,3 @@ File: /usr/lib64/cmake/rccl/rccl-targets-noconfig.cmake 100644 root:root dcd89184125cddb0cdd7a5120e372b92 -File: /usr/lib64/cmake/rccl/rccl-targets.cmake 100644 root:root 1f833abef2648d4339a6b6449612a9d8 +File: /usr/lib64/cmake/rccl/rccl-targets.cmake 100644 root:root 256195dcb68d97fbfd319461c37b5fd3 File: /usr/lib64/librccl.so 120777 root:root librccl.so.1 @@ -72,2 +72,2 @@ File: /usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml 100644 root:root 7aa98e2a1da9874a225b1776b234c9fc -RPMIdentity: d0c9328af4b4795c8b8b443a20328bdf1e45db1f7d495ee0ab628b0f7f5056e5f88afae3c7185a384afe02fb2e07fcc7b6d8ecff3a996ef5a27c764c173c478b +RPMIdentity: cc45d25677dab0b294609a7c92c404b8121093d73e6a84f67e2d7bd1dab17b59f7de30cdfe4bb06c7829daec45af041f06bc99733f4a7f19191211dcc2f23b8c --- librccl1-2.18.6-alt0.1.x86_64.rpm.repo 2024-08-13 08:56:51.000000000 +0000 +++ librccl1-2.18.6-alt0.1.x86_64.rpm.hasher 2025-02-09 06:20:12.056867988 +0000 @@ -43,6 +43,6 @@ Provides: rccl = 2.18.6-alt0.1 -Provides: librccl.so.1()(64bit) = set:ldySY8WxOALBnhFpKYr8hTuOp4f4mGu2jLdMJjcZCXM47UXuwyyGRGWXKgETcgdjMi5wuDQ3qOxtZBm81J7pYPMIUZa5VdctQkKefUrjndPqhuFfak8KACxDBZ2WZJDfvJzZ89VmVuIkNiinUuRvWX09AlpiViW0mDiqb8i3YJossrximfgU5FDIg3bfAM3p87RAKcG4MZinBzsSGNgsBCROo9k0v79172vNT21EO938Mcw8TzCb018bhHvvzgmTvhhNQWFQoI4SSRedfYZyMcS4HABqmacW4xzCUZaO5x9LSUxVFl0qy5C7FFGgAn04Hyxww4hPwz6LsL4UDEnEe2dpGZx29zB56rIHYGcZG1BqjQafIX1WE3sbDhXCpfBjMq4 +Provides: librccl.so.1()(64bit) = set:ldySY8WxOALBnhFpKYr8hTuOp4f4mGu2jLdMJjcZCXM47UXuwyyGRGWXKgETcgdjMi5wuDQ3qOxtZBm81J7pYPMIU1D4ctQkKefUrjndPqhuFfak8KACxDBZ2WZJDfvJzZ89VmVuIkNiinUuRvgP2VSlpiViW0mDiqb8i3YJossrximfgU5FDIg3bfAM3p87RAKcG4MZinBzsSGNgsBCROo9k0v79172vNT21EO938Mcw8Tz6KgGdnaHvvzgmTvhhNQWFQoI4SSRedfYZyMcS4HABqmacW4xzCUZaO5x9LSUxVFl0qy5C7FFGgAn04Hyxww4hPwz6LsL4UDEnEe2dpGZx29zB56rIHYGcZG1BqjQafIX1WE3sbDhXCpfBjMq4 Provides: librccl1 = 2.18.6-alt0.1:sisyphus+353658.300.4.1 File: /usr/lib64/librccl.so.1 120777 root:root librccl.so.1.0 -File: /usr/lib64/librccl.so.1.0 100644 root:root 2e5c3ab2b95eede8c321abb70ecf8073 +File: /usr/lib64/librccl.so.1.0 100644 root:root f2ee2c85fe410812a4dace0305289f0e File: /usr/share/doc/librccl1-2.18.6 40755 root:root @@ -52,2 +52,2 @@ File: /usr/share/doc/librccl1-2.18.6/README.md 100644 root:root 7f63560222074951adb129e12c2ea047 -RPMIdentity: e713b9c77db3c8749c70a5cd7ad4ea3e27866bc78e36d700dc50200eab38283472282f7d4b17396a7025a778ee2ab574f956b3a1db80316db878d3e5f095ae6d +RPMIdentity: 76bf94a1b4c817afc320777992ce50be2dddc1ff90ebceced2b2b387781a22cfddcd2cbda1a1f1fc368a3b9cb3fb9922cb534a677b4dca7c377674841ba8cbd7 --- librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm.repo 2024-08-13 08:56:50.000000000 +0000 +++ librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm.hasher 2025-02-09 06:20:12.145868127 +0000 @@ -1,4 +1,4 @@ -/usr/lib/debug/.build-id/06 40755 root:root -/usr/lib/debug/.build-id/06/e0e74b54f4c9a58ada6b6594323d08d8b96adb 120777 root:root ../../../../lib64/librccl.so.1.0 -/usr/lib/debug/.build-id/06/e0e74b54f4c9a58ada6b6594323d08d8b96adb.debug 120777 root:root ../../usr/lib64/librccl.so.1.0.debug +/usr/lib/debug/.build-id/78 40755 root:root +/usr/lib/debug/.build-id/78/384b19fe4e748c046c9a035ea9790147ef19d2 120777 root:root ../../../../lib64/librccl.so.1.0 +/usr/lib/debug/.build-id/78/384b19fe4e748c046c9a035ea9790147ef19d2.debug 120777 root:root ../../usr/lib64/librccl.so.1.0.debug /usr/lib/debug/usr/lib64/librccl.so.1.0.debug 100644 root:root @@ -300,6 +300,6 @@ Provides: librccl1-debuginfo = 2.18.6-alt0.1:sisyphus+353658.300.4.1 -File: /usr/lib/debug/.build-id/06 40755 root:root -File: /usr/lib/debug/.build-id/06/e0e74b54f4c9a58ada6b6594323d08d8b96adb 120777 root:root ../../../../lib64/librccl.so.1.0 -File: /usr/lib/debug/.build-id/06/e0e74b54f4c9a58ada6b6594323d08d8b96adb.debug 120777 root:root ../../usr/lib64/librccl.so.1.0.debug -File: /usr/lib/debug/usr/lib64/librccl.so.1.0.debug 100644 root:root f8eef68f4c9d90c9baa5adae3fa5e884 +File: /usr/lib/debug/.build-id/78 40755 root:root +File: /usr/lib/debug/.build-id/78/384b19fe4e748c046c9a035ea9790147ef19d2 120777 root:root ../../../../lib64/librccl.so.1.0 +File: /usr/lib/debug/.build-id/78/384b19fe4e748c046c9a035ea9790147ef19d2.debug 120777 root:root ../../usr/lib64/librccl.so.1.0.debug +File: /usr/lib/debug/usr/lib64/librccl.so.1.0.debug 100644 root:root 4aedf9999fabfca1b05f4c8a1f959a1e File: /usr/lib/debug/usr/lib64/librccl.so.1.debug 120777 root:root librccl.so.1.0.debug @@ -589,2 +589,2 @@ File: /usr/src/debug/rccl-2.18.6/x86_64-alt-linux/include/nccl.h 100644 root:root 88c99b744f34dbc0b9c2f53fd9431572 -RPMIdentity: 29d02fa69e031265be273d534b520a374d6107348fa62b1cdfa2b723d006c08d734c8e9950fc8dc604603c646100fa1259f3fc5a5cb1d982dd16a747e0d1d189 +RPMIdentity: 540aa58d173dae14c60ee74c72c33ff84d4add3660a60070ccc66a37ec057057959081da641248f7cb95c184114abdf5ef218141733d2085031036fede44cc9b